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PREFACE, 


The present course of Lectures on the Measurement of 
Groups and. Series deals with some of the most modern 
motliods of atatiatical research. Interesting as they were to 
those who had the advantage of hearing them delivered, 
they will doubtless, when studied at leisure in printed form, 
prove even moro interesting and useful. 

Tliose Lectures are the fifth of a Series originated in 
1897, designed for the assistance of Actuarial Students in 
connection with matters not included in tlie ofiioial Text 
Books. Tliree of tho Series deal with legal matters, and 
ono with the subject of Stock Exchange Seouritioa. The 
present course carries the range of topics into tho field of 
mathematics, and it is hoped that courses of lectures may 
be heroafter provided dealing with other subjects, practical 
and theoretical, relating to those branches of knowledge 
which it is the province of tho Institute of Actuaries to 
promote and encourage, 


W. H. 


2Aili J^ehruary, 1903. 
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Note. — I n the followintj lectures I have iiiiule Ireo use ol‘ 
statistical method b and formula?, which (though in many on sob of roooiit 
origin) mny now perhaps he regarded ns common property; but 1 hope 
that I have not inadvertently cpioted without reforoiico, or iniHqiiolod, 
investigations or theories, which may ho regarded as personal to any of tho 
small body of statisticians worltiiig on the .subject treated, Tho lootun's 
had to he prepared botli for delivery and for tho l^rcss at short uoLico in tin* 
midst of a hns 3 '' session. This fact innst ho mj’ apology’ for any ohsonroness, 
uimecesaaiy repetition, or eluinsineKs of armiigeincnt or expression, whiidi 
may be found. 



MEASUREMENT OE GROUPS. 


FIRST LECTURE. 


GtBNTLBMBN, ib wfis witli consiLloralilo diffidouco that I 
undertook to lectiiro to membora of the Society of Actuaries 
on a subject witli wliicli tliey may bo presumed to bo so 
I'amiliar. When I was asked if T could undertake these 
lectures I had some difficulty in choosing a suitable subject, 
and then ib oocurrod to me that my audience wore probably 
coiioorned with the practical aspects of a question which I 
was chioJly considering from the theoretical point of view, and 
that it would thcrofore bo most suitable if I endeavoured to 
lay boforo them somo theoretical considerations on subjects 
which did not come in their ordinary course, but which woro 
allied to the subjects which naturally come before them, and 
which are allied to those subjects on which I have spent a 
certain amount of iimo and attention. 

Groups. 

'Jfiio first subject which I have selected is the moasuremeiit 
of a group, the characteristics of a group, and its represouta- 
tioii. By a grou]! I undorsiand a number of persons or things 
oach of wliich possesses a measurable characteristic, the 
gi’ouj) being arranged aocording to the magnitude of tlie 
charactoristio. If or example, if I have returns of the wages 
of a largo number of people, and I group them according to 
thoir wages, saying how many are earning 20 s. to 25s., and 
so forth, I shall have such a group j or if I choose a section of 
tho population and group them according to ages, I should 
have another group of tho kind I am thinking of. The 



voiniirlcK I sluill liiiV(' to iniilco iilioiit, will, I )i 

Pfiirly ami iipply to a vtaw tni'i'H' I'tiiiil’o td’ n 

but for coiivoiiioiu'c^ of illnsl I'lilmm ! sliiill otniliiio 
only a small luinibor. 

Tlio ])arii(mlar ^’ronj) I iitn liikin^^ for tlisciissii 
ovoning is (iakon from l-Iio oiirroiUi (kaimtH, l-lio untiil 
miirricd wunnni in tiio (‘oiinty of York, on (Jonsiis tia^ 
grouped a(3C’or((ing (o iJhmV ag('s. In si'b'ol'tng a g’rt 
d]S(3ussion if luiisl- bo ljirg(5 onoiigli and snmli ('non, 
must bo suflicionl'ly largo l,o ctna'i'ai individual pooni 
or j)C!onl{arilii(.'s of small S(!t'iions ; il, iiiust Im siilliciiaiM 
fo 1)0 lioiiiogom'oiis. Moldi tlios() liinils siri' l•^'la,l.ivl'. v' 
lliat is largo oiiougli foi* ono piirposo is (oi) InrgH' for i 
purposoj and a group I lull, is iiouiogiuu'ons for oiu' is 
goiicous for anoilior. 'Tho douMi rain of a, wliolo ('mini 
bo sidlic'ionl, forciorlain ooinparisons, but, foi- o| lu'r conij 
you musf subdivido a(50oi'ding' fo disliriolrs and ago. 'I' 
tbaL has boon solootod miisf la' ki'pi in viow la'fo 
argitmonfs UjI'C) 1)asod on ilio grouping aaid its nioasuroi 
Tlioro aro fwo main divisions of groups; tliosi' t. 
doriv('d Irom oxaef olisoi'valrions, and llioso wlucli ] 
rogai'dotf as siuiijjlos ol a largor group Mio wbolo of wli 
no(j boon luoasnrod. Kor ]mrj)os(’s of rob'i'onco I n.ni 
Itiioin (ii'oiip rt, wlum Ibo olisorvations aro snppos(*( 
oorro(ifj as, for o.\!nu])lo, flio mnnbor of pi»rsous wln> 
rocoipl of a ooriain inoomo : and (iroiip /'i, win'i’i' Uio n 
aro osfimaijus; for oxaiiiph', a.ii osfimalt' of fho nun 
porsous wlio may bo o\p(‘o|,od to bo in roos'ipt- of 
inoomos Lou years lu'ma', fi'om an invosi ig’aiioii of som 
now, 01 ' aL .soiiio previous l,im(', As rogards (ironj) 
ehiof work will [)o (,o selool, some luoLliod (d' abbro 
of doseribing ill liric'l', oacjli group; in tbo case (d' (I 
our work will lie oliiolly io crilioiso Ibo corivolnoss 
sLatomonts, and Lo (bid niotliods wbioli ari' } 
applujablo for iLs oorroeLioii if if is noL (‘vaol., to imm 
prucision, and Hum alLorwards Lo soloed soino siiiLalilo 
of abbroviaLing IL. 

Thm {{uai’Iiu; MuTiion. 

J lio Lwo oliiof iJioLliods of ubliroviaLing or iiivi's 
Lbo cIiaracLorisLies of a grouj) aro Llio graphio inolJ 
tbo moLliod of avoragoH. Tlu' moLliod ,d’ a-vt'ranms 



perhaps ho reforrod to first ; hiitj siiico tlio nso of diagrams in 
explaining bho meaning of averages is very considorahlo, I 
have fchoiight it hotter to tako the method of diagrams first* 
I have drawn ont, in four different waySj tho group already 
named, tho nmnhor of married women in tho county of 
York. 


Ages of Wives -present with their Husbands in the Megistration 
Count g of Yorlc^ 1901 , 
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'L'horo are shown in Diagrams I to lY, tho iinmhors of married 
women in that county por thousand betAveen blioso ages. 'J'ho 
total of wives in tlio county of York living with their liiLshauds 
was 610,000 odd. As is usual, the numbers arc divided in years 
bo tween the ages of 15 to 21, and after that in five-yearly 
groups. The first method of representing figures by diagram 
is to place a dot in a given vertical position for each person 
or item in question. This is indicated in Diagram I. 'I'ho 
method is not very important and is perfectly obvious. I 
should only use it ns a means of passing to another, if it wero 

II 2 



I. — Point Diaquam. 



not that in those classes of measiiromcnt wlievo tho quantities arc 
separated by a finite iiitorval it is incorrect to uso tlio niotkocls 
shown in Diag-ranis ITj IIT and IV. If one was ontoring* 
tho number of liouses at particular rents in n towiij wlioro it 
might perhaps be supposed that the rent always junq)ed l>y as 
much as £2, one could represent proj^erly tho numbor of 
houses at each £2 mark, but there would bo no house at tho 
intermediate intervals, and it would be incorrect to proceed 
any further to such a curve as would load one to suppo.se Unit 
the quantity dealt with was continuous. To iEiko another 
example, tho railway service from one town to aiiothor might 
ho represented by a series of dots jdaced vertically over tho 
timo taken by the train, measured horizontally, hut not by tlm 
follo'wing methods. If, however, the quantity is capablo of 
continuous variation, such as age or height, or if hy a slight 
extension of the meaning it may he regarded as being capable 
of continuous variation, such as income, wo nmy procood tt> 
the method of Diagram II. 

In Diagram II rectangles are drawn whoso heights aro 
the same as tho height of corresponding lines of dots in 



II. — Ahea Diaciiiam. 



Diagram f, but tlio broadtli in the unit of aljscissQ 3 _, in Uii.s 
case livo years. M’Uo areas can bo rogarded as repre.sGnting a 
11111111301’ of porsons, ddio aroa of the wliolo .space enclosed 
by tlio outer lines oE the rectangles is on tlio scale choseiij 
4 square inolios, whicli represents tlio wliolo of tlio population 
considered; tho broadtli of eacli roctanglo'^ is ^ inch, and 
t inch squared represonts 1 per cent. 

Boforo we can go any furfclior wo liavo to make some 
assumption as to the distribution of persons within tho five- 
yearly intervals solocLod. Even in my class, (a.), when the 
observations are known to bo correct, some assumption innsb 
bo made as to distribution boforo proooeding further. If, for 
example, the correct sot of measuromonts oE the hoig-Jits of a 
rogimont was given, ovoiy soldier boing moasiirod correctly 
to tho nearest no correction would be required 

for actual mistakes, but before a continuous curve could bo 
drawn passing’ from one inch to tlio next, aomo assumption 
must bo iiiado as to distribution of hoigliis, c.g., that pro- 
gression was uniform between the given points. In tho case 
* Tlio (vvons fov ii^es below 25 avo sliowii iii ntiovo dot ail. 



of obsoi-valioim {(I) wliit'ii ily il im mU 

lU'COKWiry io tmikc' .sniiio tissiiin|)t inn ii'« lo rnnjhniii 
{ji’dor to (‘oiiHiih't’ wliai aaKiniijilinii ia jU‘n|M'r, in Mh' 
(jucstioii, naiioDilMM’ Mail. Mia I'arla I'XiicMy \ 

inntdan’H of laii'sons wlio.sa a^'i'M lit' lii'lwi'an ft'rlaiii 
Mini iM, W(i ai't' ^’ivaii l-lu' art'a td' lint r(M’taii!<'lt', or 
ilgiira wliitdi raplacaa iJit' na'Iiiiip’io on nacli iiiiil. < 
Wliai wa lmv{) Mi Hiippoao ia llial' Ilia ap’ca aro aiihtlivii 
iiioi'oly inU) yt'iu'H^ bill iiibi ittlinilaaiiiiiil iiiiira of lim 
wo liiiva io 111(1 lc(t Hoiiio aHHiini|)( itm ior ;• iii(liii|'- ns in 
fi’oiii Olio of Mio p'ivtiii posiiiotis lo (lit' iu*\t. 'i'lit'ro iiro 
poKiiiniiH wbicli ^'ivo tloliiiil.oly iJio luiiiibor oi' pt'aMnu 
20 bolow 20 ^'('iii’Hj aiid ao lorMi. \Vi' iiiivt' (o I 

iiuiiiiior of poi'soim bolow 22^ or any oilit'r lO’.si/pHsI n/p' 
is (I ([iiiio raiiiiliur idt'ii; biii (ibori' aro out' or i.wo Mi 
ooniiociioii wiih ii wliitdi li is iioot'ssary lo poiiil. onl,. 

Thm Mistoouam ANii riii: Ooivi:. 

If Hiriiip,’lii liiK's aro tirawii from ilio iiiiddin o 
borizonial lino in Diiigram II io ilii' iiiiddlo of Mm t 
goi Mio tloiiod lino in I )iiig’ruiii III (oiillod a liisl.i 


III,' lIlSTOmiAM. 




'I’hitl) \h (U'l’luiu liO 1 h) iiHiori'oct on two yToundH. In ilio finst 
aroH IxmiulcMl l)y tlio linoH inummi, ilio liig’liOKi poiiil. 
Ih iio<‘('HHiu’ily 1,00 Hinall, for pari, of ilio aroa botwtjun ilio aj^us 
lU) and db is ciil; olV l)y ilio doliod lino, and noiliing' is ])lacod 
insioad of ii. Bol'oro ])oinl,ing’ oui ilio oilior way in whicli 
ilio hisiogniiii is luioossavily imiorrooi, wo will ])uss on io 
DiiLgnini I V, wliioh is ilnis oonsiructod. Ai ilio agos shown 
on ilio Iioi'i/ionial axis aro drawn rooianglos pvo})orilonal io ilio 
nniiilMU' ol’ pi'vsons Indow iliai, ago ; ilion ivo goi a oontinnally 
nsoondin/^' liguro oallud an ogivo, which is given as absolnioly 
coiTOcl, in group a, ilio poinis ai ilio cornors of iho stops 
obliiinod in iho liguro binng given by assiinipiion, 'I'lio 
pi’obloin ooinos i,o bo io draw some lino or curve from ihoso 
(i\oil poinis iluii shall saiisfy iho conditions which wo nmsi 
assign. Now, ii would bo necessarily wrong io join these 
sncci'ssivo poinis by siruighi linos, if wo iiiko ihreo corners, 
A, M, ( 1 , noi in a, siraiglii lino, wo goi a sharp angle ai H. 
Inirodiicing slnirp a-ngU's ihoro necessarily involves an ('rror, 
for ilu\y indicalo disconiinuity al, corinin arbitrary jioinis, 
which can corrosjioml io no facts in nainro. The anglos 
oliiainod in iho liisiogram aro erroneous for similar ronsons, 
if, ns in group «, we aro io suppose the observations to bo 
c.orroci, a coniiiuious line must bo drawn through all tlio given 
]ioInts wbicli has no sharp angles in it, no sharp clnnigo 
of cniwainro, If, as in group /d, wo are not lioiiud to assume 
|,lmt the observations aro correct, the line may bo drawn not 
passing throiigli the poiid,s, but near thorn. Many groups 
may In' ri'prosented witli sulllciont accuracy in rengli work by 
drawing n. freelmtid curve imssiiig through the given poinis; 
il, will lie Ton ml tliiiX tlu'i'o is very little margin for drawing 
sueli a curve, if the rule is made tliat the curvature is never 
to be greater than neenssary, (hat tlio direction is not changed 
nmro rapidly than is necessary to jiass throngli the points, 
'rids condition, stated in niatlieniatieal language, supplies the 
main prolilom of interpolation. 

In group wo uro not boimd to asHunio that the curve 
puHscs til rough all the ])oln(,H, and tlio (piustioii which is tlie 
iiest mirvo (ilrawn freoliaiid or othorwiso) vaar the points, 
needs tlio theory of probability fur its dismission. 

iNTIJKimATION OdUVM. 

Ah ri'gards group a, to whieli discussion may be coiirmed 
for Iho iiroseiit. whui’u the curve is to pass throuah all the 




poiuif}, I Hiig’g'OKt l.lui ruiiiilianiiotlKKl <tf iiiiiii'|ntliLUini by 
bolic foriaula. 'I’aku (<lio (KjiiiiUoii | f(|.n I a-ji'"' I << 

COlltinUod Lo HH iniuiv l.td'itm m-o Mniiiriinwii.t fit fill 


Yesrs 



under discussion it would be inexpedient to take more than 

4 or 5 termsj because wo arc fitting a definite algebraic curve 
to irregular observation Sj and the law which underlies the 
observations may very woll chango if wo take a larger 
period than 25 years. I have confined the work for this group 
to 5 ierinS; continuing that sorlcs to aj*. 

Consider what conditions that curve satisfi.e.s. Stopping at 
the second term we have a straight liiiGj which can only be 
made to pass through two points. So wo have to start afresh 
at the second point, and thereby contradict the first assumption 
which wo make, that the increase is not subject to violent 
changes, introducing an angle at any point. Introducing the 
second term wo have a parabola, which can be made to pass 
through ihi’GO points; the curve has continuons curvature, the 
third differential, and the third differences obtained from the 
vain os of y at throo consecutive equidistant values of a*, 
vanish ; that is to say, thoro is no sudden change in curvature, 
n'ho first differential measures the inclination of the line ; tho 
second moasuros ilio change of inclination, and if that is 
constant thoro is a constant change of inclination, hut no 
sudden break. But wo have no reason for assuming a constant 
chango of inclination, and the curvo which passes through 
threo assigned, points will not in general pass through the next 
point. Wo then proceed to include further terms. If we take 
tho ocpiation u[) to we can introduce a point of inflection, 
which wo cannot do with the parabola. If we take tho 
oqimiion a stop further wo introduce two points of inflection, 
and it is unuccossary to go as far as the fifth term in a 
diagram like this. If wo take 0 terms, tho Otli differential and 
tho Otli difforonco vanish, tho hth difference is constant, and 
thoro is no suddoii break, and so on. 

Now take tho equation as far as the term oi*. We have 

5 unknowns, and can doierraine them by assigning the 
condition that tho curvo shall pass exactly through 6 points 
on tho diagram in question. I havo calculated the equation 
of tho cuiwe whicli will pass exactly through the 5 points of 
tho ogive, corresponding to 20, 25, 30, 35 and 40 j^eara. The 
method of calculation is a good deal facilitated by the use of 
finito differoucos. llofor as origin for ilie abscisses to age 20, 
take 5 years as unit, so that sr is 1 at ago 25, and write down 
tho equations which naturally arise, taking the numbers from 
tho last column of the table on p. 3. 
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I f<;i t ft, I u, 

I I I 
'107 — «(i I ni.O I (fj-'f I «!•■*' I 

Dryl^von I Oi.’l' ! II j. !■' t ui.'l' I (t,, I'. 

I(y iH onsily shown lliiir ’.li, </, A,! 0 

Oa = A,"r:-2“A;l-j-iJ -|' .1 ( of A/i, iiml (O ntnl «„ iii-o tinni 
ofikmlakKl, In iJiis cnsi' A,',- 'Ik, A^l <t'0,A,I Vl‘,rf, 
a,= -2;h|, aa::=i):5:';, a,--.lO',', 

IjoIj 1)o Mm (<(|itiiljon of IIm' 

lnsU)j>’niiii, tind //~K('i') Hut (tqinilinn nl' lln' nj-iso. 

Hy nutiiim ni tlirso oipiitl imi 

pariil)oli(J or sniooMiod ourvi' imw oliliiiiioit rroin Diiij'n 
(ifUi 1)0 iisotl 1,0 runiisli viiliK's |,o n'|iliu'o ihn 
Diit-gi'iun III. Tint simio unil, lur .n (livo ytmi'u) nil 
cour.so; 1)0 list'd in l)o(,]i ciihi's. TImim, i(’ .»• | (Of, , 

I- On,. T' I |,f,. I • |:{;,.o. 


yi= z,dii', niul 
. n Oil! 


At. 00 yoiiVH, r;- KKI-i) ; n|, Ofi yoni'M, loHnS. 

Tho onrvo oliOiiiutd in |.|iin wiiy in nlinwn tn coiili 

lino in Diiij^'raiii III; iliis curvi' snt inlit's ndilimi 

ilio tivoiiH stiindiiif*- on (ho fi-ynni' lmt.i-.M, IVum Oo y<.| 
45 yoiU'H, slioiihl ntjin'si'iil- on llu' clinnon m'hIo (ho luini 
porsons ^dvon by |,]io oi'ip'iiuil liilih', iind Hint I horn uhoi 
no al)i'iip(, (dian^'os of (nirviil.nrt', 

Sin(30 Hio oiivvo Inis hi'i'ii rlioson fio n*, (u mil inf 
conditions for only llvo 11^0 |H‘riods, i( \m|| iiu|. iktoi 
H atisly any nioi-o; bnl, in Hiis ouso Hio onivo inor/:oi 
a^siniif^Hil lino, wliioli approxinmtoly riillihi iho oonilil.'io 
55 yoavs. tf wo nood f^ri'alor airuviu-y in hi, (or ^yoiii 
bhonid oalcnlaio now valnos I’or I, ho u’h ' and ohlnii/n n 
oiivvo, saikl'yinf,^ a now group id' nroa coiitliHoiiu. 
iiQodod to draw Hio wliolo ourvo aooiirulidy, wit nlioiihl 
to dovisu a nioHiod of imssing witlioiil, a liri'iih of oojili 
from ono such ])aral)olio omwo to (,!nt ni'xl, ; hut, an l(, 
only want moans of oUaining spi'oiliod poinln on Hio ( 
and that can ho dono hy choosing Hio HporinI paraholio 
that IS m tho noighboiirhood of tho roimirod noinls. 



AvKKAUI'JH. 

Tiii'i 11i(' jxiiiil. ol' Uio hiiiooI.1i nirvo in 

Diiij^’riuii 1 1 1, ; Ikmioo ”0 in Uio oj^'ivo I'ov ilio ssiino 

viiliio oi’ ii!. TImihj iih in olliovwiHo ovidiniL, Uio ogivo in 
h1,oo|){’h1. oihI Mioro 1h ii. piiuil. of iiiiloxion, iil, Uuii vnliu) of <r- 
wliii'li llio Dnliiiiifo in Diii^’riini IJI 

If =::;() WO lul.vo 0 = 2lt., *1“ "|- l‘2ai.i}“j himI 

r/.ir 


wlioi’o o.j, 0;i, f(i nro f^'ivoii in (.oniiH of l.lu) difToroiHUis abovo. 
VVriliiif^' ill l.liOHo viiIih'Hj wo liavo — wliicli oorronpoiulH 

1,0 :U)* 10 yoiivH. 

If wi' liad iuolmlod a furUior l.oniij wo Klionld liava 
II (Millie 1.0 Molvo 1.0 dol.i'Viiiiiio 111. If wo liad only f^’ono us 
fai* iiH 0;i.r'', wo hIiouIiI liitvi* Mio o(j nation () = r(.j'l’ flint 
in, I : 1ml. tliin I'oi'iiuila is miHafoj nnloHs ilio 

fmii’l.li dilVoroiKM'H of tlio ovij^'ina) i’if>'in'OH nv(' a]i]n’oxiimitoly 

'/.(M’O. 

M'lu' o([iuition l''ii.Icoii iik far im tlio itd iorni apiioavK to mo 
1,0 1)0 pnielieully tlio IiohI. in tlio oxamiilo wo avo disoiiHHing. 
If wo (‘hooHo l.Iio enollioionl'K to Katisfy t.lio oonditioim Htavtinj^ 
from tho a,f!jo 25, wo obtain llO'd.Il yoai’H uh tlio iionition of 
tlio liiKliimt point. Tlio diHOi'oininoy botwoon tliin and tlio 

:{()‘10 yoiu'H fnnnil from tlio piirabola Htai’tinf>' IVoiii tlio ago 

20 yolii’H, ariHi'M fnnii tlio iiidotonninatonosH of tlio original 
ligMiroM. It HooniH liowt to talco tlio valno from tho fovnior 
ourvo, lui I'lio poinl. tlion lion noar tho niiddlo ol tho anHignod 
valitOH. Wo adopt tlnm tlio ago MO' 10 yoai'H iih tho voifuirod 
ago, and lind that y- lOO'Od at that ago. Tho ago ho found 
in'eallod tho mode of tho grnii]); it in aim) callod tho portion 
of <1 nut hi fit f/miN/Vy and of tho mmv/nnvm ordiiKih'. 

Tni’i MnniAN. 'Tho abHoiaHa of tho point (M) whoro 
tho ogivo Ih oat by tho hoi'i’/'.oiital lino liall way ii)) tho 

Hoalo (from 0 to 1,000) ia oallod tho mMia'iu In tho 

liintograin, or tho miiootliod (Mivvo whioh roplaoea it, tho 
voi’tioal through tho niodian dlvidoH tho ourvo into oipial 
aruiiH. Whon tho ogivo Ih drawn, tho modiiin oaii at oiuio 
bo found graphioally. To Ilnd it idgobraioally, tako^ a 
pariiibolio (Kjiiation as lioforo, HaiiHliod by hvo pointa lying 
l.n<k nti ((btn.iii tho oiudhoioiitH UK lioloro, and put. 


,y = nO(), Olio or Ml(' I'ools nf (hr 

nuHlmii. wr huvr 

aiHl. HolviiiK ]).v llonK'r’s iiirlli<i<l, s.. tin., 

inetliim ngo ik Md I (yriiivi). ^ 

Tlio tiuarlilrf^ ni'r Mk. nh.^u'ihMir rl (hr [Himld ((,), 
whom Uio liori/.ontiil liii.vMnio-iitnii-Iri' nmi lhivrH|iiiiH« 

I, ho HOlilo (IVinil (1 In l,<ldd) 0 ..( (hr ug’iNr, 'I'l.r v. 
Uiroiigli (.Iioso in Diiig'i'iin. Ill wnild, l.n;' 

with rho iiio.liiui voi'linil, dividr (In' iinM iiitn Imn- 

piir(.s. 'riiu (imnliloH run ho I tVnin unr Oit' r<[in 

almmly w.-il-ion h,y ]/ ::h(» mid Vdo Miirr.-w 

mid Kolving I'nr ; nr Ihr^' nm hr I’nmid |»hiridl y. 

A I'riigli innihod r)' liiidini'; IJioho pniid’i, nl (on itiill.ri 
HC< 3 in'nto, itnd Kii-viiig’ n< iiinro hihin'iniDi M.lidlnn, in l<. ii^ 
ihni iho j.fii'I'H ol' i.ho ng'iv.t hri.wi'rii llir rnnirr,*. \ 
coni, mil tho iiiodiiin ni’O nii'iiir;hl. liiu".. ’I'lirro ni-r lUV 
UiriiMniid) bolrw dh yriii'H; dd mil. nl (hr d.i (t> 10 
g’roiqij -vvlijoh ornl.iiiiiH I '17, iii’o l.n hr luKoii In rr.irh 
iiiodiiiii} which in nii (.Ik* hypnt hoii'i n) it 'ilrmg'hl 
of h) y.'itrs, lilnil U d.S'lV ynii'*., n \iiliirdilli 
liUdo [rrm Lliitl) iih'oiidy nhininrd. 'Plir Inurr )|iiitrlil 
oil^hor moil.nd i.s ;iO‘ld yriii'.s, Iho ii|ijir)' IS’li \riii-'.. 

V7o liavo MOW (.hr I'nllnwiii/*' llgiirr^i : 


liowor ipuii'l'ili' 

MO- It; 

yt'nn 

Mndo 

MO'IO 

M 

Modian 

M8‘l 1 

M 

Ai'itlmiol'i.' avoriigr , 

•IIMII 

1 » 

U])por (piartilo 

dS'O 

1 1 


TJio ariilnmslio, avorng.' in oiilciihitril dii'ordy in 
ordinary wiiyj bnti ia of liMilo iinpnrliitiou in nindi a p; 
as Lhia. 

If a ])(}rHcni ia tnhon ai/ I’lmilnm IVnni Ihiii /fmnp, 
inoai probabUi ago ia MO' I yriirHj (.In' ninth'. If in loi likr) 
not i-liaij ahr will ho nvrr MH’I I yriii'aj llm innilimi. I( j 
likuly aa iirl. |jha(y alio will ho hnl.wot'it MO'IO ninl [M'll yi 
Tho ohanooa aro fM in I agaitiali hnr hriii/'; Ira-i l.haii M 
yoar,s j M to ,1 againat hor hoing ovrr 48*0 yriiivi. 

Oihor pointa oaii ho olit-uiinal hy tlivitliipr' thr group 
ton ocpnil partHjOr oiio liiindrod otpiid parl.a. 'I'hran ni'o ra 
tlio decilct^ and pm'miilt'N roaiuadrivttlv. 



mode in bucIi groups as this seems to be of 
special importance, as being the most probable value, lb 
is entirely unaffeotecl by the extremes. If the Census 
authorities had omitted all married women over 50 or all 
under 20 years in their enumeration the mode would be still 
in the .samo place. That is very iinportaub wlien we arc 
dealing with inaccurate figures. In ilio.so curves wliich have 
a distinct mode, where the curve fir.st tends upwards, reaches 
a height, and then comes down again without ever pausing or 
returning to a second height, and where there is a certain 
symmetry or similarity of distribution on either side of it, in 
such curves the mode is of special importance. If, on the 
other hand, you have a regular mountain range represented 
by your curve, the mode is probably of much less importance. 
If you have a single peak it is probably of importance. Bnt 
though it is important in itself it is quite insufficient to 
dosoribo the curve ; it only tells you the position of one point ; 
it docs not tel] you the steepnoss on either side, or the distance 
from tliero to any assigned point. 

'J’ho mecluon is affected hy extreme.s to somo extent. If 
tlio authorities had omitted all the married womou over 
50 tlio median would of course havo been shifted, but not 
very much, lor tho area, which would liave lioeu left out 
at the oxtrome rig'ht, when halved and distributed in the 
neighbourhood of tho median would bo found to have 
cau.sod only a very slight displacomont of it. That can 
bo verified from Diagram IV. I’o take an example which 
can bo supplied by tho diagram, suppose you emit all 
tlio.so boy end the 800 per-cent, which gives those above 55, 
thou the line through the 400 would give the median, wliich a 
very rougli measuromont gives as 33 years. That is to say, 
tho median has only boon shifted five years by leaving' out 
that ill! men so number. If, instead of omitting these people 
over 55, tho Census authorities had simply said, “ Hero is a 
married wonuin, obviously old, wo do not know her ago,” and 
had outored her in that category, it would not have affected 
tho median in the very loast. IIio position of tho extremes 
does not affect tho TOcdian, only tho number of instances, In 
the statistics with which I personally havo to deal, often all 
that is known is this number. In this respect tho median 
is very superior to the arithmoiical avorago. Tho same applies 
to quartilos. If wo do not know the oxaot ^^osUions of the 



iiiK(!in(*(‘S In) 1 1)0 or io tlio h'l't ol I ho i|ii{irl i1i«n i|. <|o 

iiinttoi*, ])i‘ovi(lo(| \V(’ know (In* iiiiinhrr'. il \ on tlooiil 
(jiinrtili'.s luul l.lio iin'diaDj yon Iniso thi'oo |inii)l‘> on llm 
oiirv(‘, rliroo po.silioiiH in IIk' lii'<(o|>-riini, lrot)t which llm 
('an orion ho coiisiriicl.t'il wilJi I'iiir iiccurnc\, 'I'lioni'ilti 
iivoruf^o, or Hi)n))ly ‘Min' nv('nio'o,” j;i\ o'* tlio ntaicifi'ui < 
t!onlv{! ol’ ^’I’lwily ol' l-lii' {sroiiji \vli('U jiluHt'd out 
DiagTiini III. Tlin nril liini'l ic iivi'ra}\i‘ Incililnloii o 
oomjniliitioim^ hiitj in my ('\|H'i'ii'n('(', il i‘. Iht' loioil viil 
ol: Uio moiiiis or tivoraj»’oH which can In' cnlcnhilod ; 
pooploks oxporitMioo nniy lai dilVi'i'cni. 1 1 r. voi'^\ lial 
error, IT a pai'l of Mn^ proiip ia nccidcnl)i tly oinillo 
uvoi'iifi^o is iiL 011(10 an'ooloil, 1 1' Hit' nuinlicn. iwr coi i-t*t! 
tliu jjosiilons ]io(; rc'j’y I’ai’ oal', yon would llinl liy o\pci’ 
tliibk Uio a.rlMiniol)i(‘ iiiVora^pi has not' ino\cd tiiindi ; Inil di 
any iiiinila'rs aro h'lli oiil, Mni iii'ilhim'Hc n\'ornj>o is diiili 
IlnL ili(' roiisnii I disi.ra.sl) llio arilhnn'lic ;i\crii'p' and d 
(ulvocato its mso i.s, chiolly liocaimi' il rciidon sncIi I'alli 


iirf>’innoiiLs [los.sihh'. il' yon am coniparitiij oio' pj'onp 
anoliliorj allor a lilrilo inli'rval I In' ai’il liinot i*' n\ci'itjp) 
havo roimiinod ipiiio sloady when I In- pponp him clit 
oonsidorahly^ both Iho nxh'mncs having;' coino in townr< 
moan; oi' iL may sliilli when I In' p'ronp him nol rcnlly chi 
its oliaraotor, lm(, only sliil’lod i(,s posilion a little, 
particjuhii' ohanj^’o ol Um ai’ilhnnaic a\onip;c ninv coi'ina 
to an inliiiiLo iuind)or of diH'orniit. kimhi ol dunipo n 
groupj and ilm.s very ol'Loti poinicd on(. lhal u ct’rlaln } 
inis ofningod, iJiak soincthinf,r has iiupi’ovcd In'ciiimt' 
ariUiiiiotio avora-go lias changed ; whereas il Is only nil 
tho rohitivu posiUoiiK of (,\vo g’ronps which are no|, conn 
in roaliiy.’l |J wo havo a p(<rl'(’c(ly honiog;eiieoiiM p;roir 
insianoo, il with wage statistics, w<' di<al with ii net ol' 
doing' siinihir work and earning Himihir wages,}! eluinge i 
antlniu'tio average is signilioanl ; Imt if we are dialling* w 
composite group ooniposed of skilled and nn-ikilled work 
two ilomogoneoiiH gi'oujis merged into one, (ho nril In 
avoragomighL increase (ntlii'v hy the higher g;roi.|. ,mrei 
a httlo wlnlo the lower group went down iicarlv let far, o 
0 lor way about; or by a, eomlnmitnni of those two (h 


of ‘i'" 'ciuinl 

aviL 1 a ^ of fu 

lualimoLic ivvcnifro m mhoI very cini'li'HHly. 



So tlio aritlnnetio avoi’Eige can never give definite informatioiij 
and very often gives fallacious information. 1 have not time, 
and perhaps it is not necessary, to dwell upon this point, and 
refer to the correction factors for Urban death rates. The 
necessity of that method illustrates iny meaning in saying that 
before an arithmetic average is used, it is necessary to make 
sure that the group is homogeneous. 

Tlio quar tiles and the median not only give the definite 
position of iho median, but also a measurement, ■which serves 
to slio-w how the curve is dispersed from its central position. 
The distanco between the two quar tiles, 1 8'4 years in this case, 
shows to some extent how the curve is dispersed from its 
central point. That I shall return to in giving’ other 
measurements of this dispersion, 

If wo were dealing with a group that did not give any 
such regular figure as this, a group to which the mode was 
certainly quite applicable, it would probably then bo best not 
to attonqit to draw any continnons curve at all, but to keep to 
such a diagram as that on page 5, and to calculate the deciles 
us accurately as possible. By making some simple assumptions 
as to continuity, it would bo possible to calculate roughly the 
nine docilos, dividing the area into 10 e(pial parts, and enter 
thorn as a description of the group. I think that is the only 
method of satisfactorily representing an irregular group which 
cannot bo divided into distinct homogeneous groups. 

CoMi’Ainson or GIkoups. 

'I’lio ogivo diagram lends itself more readily than anj^ other 
to the comparison of the two groups. I have selected two 
groups, which oiio might wish to compare, from the same 
Oonsus table, the husbands whose wives wore between 
45 and 50 years of ago, and the wives whose husbands were 
between 40 and 45, which are represented by the lines 
LL and KK respectively; and I have calculated, by one 
motliod or the other, tlio mode, the median and the quar tile of 
those groups. Thus, for instance, from the curve IC, of all the 
wives whoso husbands wore between 45 and 50 years of age, 
as many wore less than 45 '5 years as wore more than that; and 
similarly for the quartiles. The curves are very similar, the 
husband curve being four years to the right of the other. 
'JTio motliod needs no further comment. 





lrj;USTRATroN OK OP Tnp 3•^^:DrA^’ 

T may take one cxainplo to illustrate the use of tlio 
niochaii. The cliagram on p. 18 re])resents tlio weekly wages, 
valuing ovory thing that is paid in goods anti not in money 
at an appropriate rate, of throe classes of labourers in 
i'higland, nainoly. Artisans in Provincial Towns, such as 
Jliriningliam, Agricultural Labourers — the average for the 
whole of I^uglaiid — and Labourers iu the same towns from 
which the Artisans woro selected. I’lio figures are rather rough, 
and iliorc is no matorial for inuking them exact j hut I think 
the lines drawn roprosont with fair accuracy the course of 
wages ; for if wo once established the fact that all agricultural 
labourers aro below tho median, wo have simply to count them 
and not uuqiiiro about their wages. And so if wc establish 
tho fact tliai any body of men is well above or well below the 
inodiaii, wo have not to enquire into their wages, but simply 
to count them ; and to find the median we have only to 
investigate more car of idly the body of men whose wages are 
near the median j that is a comparative!}’^ easy task, because 
tlio body of men who aro near to it arc those whom we see 
any day in any ordinary industrial undertaking. The 
Gonsns figures are bad for this purpose in 1902, and 
tlioy were much worse in 1801 j and there is a great 
deal of compiiiatioii and guess-work in determining the 
position of the median at any time through tlie century. 
Put it can be done within certain limits of accuracy where 
tho task of dotermiiiing tho arithmetic average would 
bo hopeless. When wo have determined tho median and trace 
out tlio positions for 110 years wo have a much more 
interesting and exact piece of information than if we had 
made use of the arithmetic average. We have tlio wage of 
that man who is half way up tho skilled wage earners j but if 
we givo the arithmetic average it will carry us no^ further ; 
it is simply a iiuinerical cpiobLont. The lino in the diagiam is 
drawn through tho oatimated positions of the median for all 
male adult wag*o earners in the United Kingdom, at selected 
dates. Those figures are rongdi, and should not bo quoted 
without verification. The only ones calculated are those with 
a dot or cross in the figure ; intermediate lines aro iiitorpolated. 

0 
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MlUSlIRMMl'lN'l' OK (iROUPS. 


siccoNi) i.i'xri'uiiic, 


'I'liK tS'l’ANDAlM) DpVlA'I'iON AND TIM'; i\l0|H[IA)S. 

ini'] mkiI.IuhIh I liiivD (uiiployi'd ho i’liv I’or doLorminiii}^' ilio 
inodiiui n-iid lilio iiiodo, li(t^'olJioi' witli Ui(} oi'diniiry hioUukI o£ 
(l(d-oi’n lining' Uio iii’iMimoMc, l,o*(olrl)or hIho wlUi ilio 

11, ml dooiloH, f^’ivo iii HorioH of doilnito ([iiiiiiljiLioK 
coniiodl-od wilJi Idui imu’vo. I'laoli of Mu'ho ijuaiilAljio.s — Uio 
niodo and Min iiii'diaii -porroi'niH Mio rnnotion of an avoragoj 
Mial; Ih 1,0 Hay, MiiiIf mi Milan' liy iljHidl' ^‘ivon liriolly ono oC Mio 
iiiohK iiii]ior(raiilp {lOHiliioiiH, oiio ol’ Mio iiiosl iiuporiaiit 
olm.iMU‘Un'iHl,i(!H of Mio wlioln oiirvo. Muf no ono of Miuhu 
( jiniiiMMoH *v'V0M Hiill'mionli in formal, ion io unalilo uh io 
ronoiiHl-iMudi Min oui'vo or In doHorilio if ooniplotoly. H in 
I, run Miii,l. If wo iiavn pi'ivnn l.lin iiinn d(‘oiloH, iiududiiif’' Mio 
inodiaiij wo Iiavo iiino |ioin(,H on a, oonfininaiH oiirvo, ami in 
g'niioral it in |)oHHil)ln io onnHinioi ii with ruimonaldo aoonracy. 
Bui if wo only liiiwn ilin iiiodoj or only Mio inodianj wo 
liavo noi (>mni}.>'li io ooiiHiriioi ilio oiu'vo. My objooij Mion, 
iH in <lnvolo|) ono or iiioro inoiliodH id' oaloulaiin^- oMior 
{[uaniiiioH rnlaind io ilio ^>'t'oiip, wliioli will nualilo iiH to 
ooinploin or iininml ilio doHoription of Mio f>'roup, hh g’ivon 
Hjinply by ono of ilio avnra|‘*' 0 H, 

Wn will alwayM HUppono ill at Min ;^Toup \h doHoriliod in 
rnlaiion io a liori/oniiM iixIh OXj and may lui id’ any nainvo 
a, bout ilio ax in. W liai wn Iiavo found ho far in ilio modiaii or ilio 
iiiodn in ono point on tliat {^roiip, ono pimition on tliai axis in 
ilio oiiHO of ilio imaln l,lin jiimition imdor tlio liip;']iOHi ])oini — 
in iho oaHo of ilio inodian ilio poniiioiij iho lino Miroiif^li wliioli 
dividoK ilio onrvo into two (upnil aroiiH — in ilio oano of tlio 
avorapjo it in Mio iibHoinHa of ilio ooniro oJ g'raviipy, I bavo 
now io liml a Hooond (pianiiiy wliiidi will onablo us io iloHuvibo 
or doiormino tbo sliaim of iho o<urvo wlioii you arc given iliis 
ono position on it. Tlio nioMiod I am going' io tako is 
imlnoitmloiiii iif iLHHiiinod Hliano id! Mm ciii’VO. and it is 



nppli()U.1>l(5 to l)()l]i ‘.'I’oiips In wliich I mi 

tlio group whioliiH Id iio ini ju’ciii'iild rt'prt'MOdljil.i 

of ihu I’ucIh, iukI iiiat wliicti n'liri'sciilM only Mimplt-s oj' 
largor group wlioso olisi'rviition ih ni)t (’iniipldtol y iiiiiili'. 
liiivo (IrHli to (!('si'i'i!)o tlu' wi'lt-kiidwii nu'llmt! nl cjilctiliill) 
tluMlovifitioiiH i'roiii lluHm'riigi', hihI IIm'ii (n pn'in nii j.o ||| 
ilio avorugo dovinlioig tlio avi'r/igi' fi(|ii!iro ol I hr ilrviiilio 
and iliu avorago (iuhoof (Icvliilimi. hi'l lltt'rr hr n titiHorval.M) 
rupi'OKoniod liy ti;,, ,i'., . . . ir„ ; Irli ,r ho llic nhacisMi of |.| 
ooiitro of gravity ; thon m is tlm HVdnig’o nl' I ho group, tlir sii 
of the) iij's dividod l)y n, Mioir iiuiiila'i', l•’rmM rnt'li of thr it 
subtract tlio abscissie of ibo otuilrt' of ;'i’ii\i(y; (hmi ir, 
ii'a iDj • > • ii'o ’itj , []ioso n I'o I Ih' d V III tidi iti ol (ini obsi'rvi 
tioiivS (I'oni tlioir UiVoriigo, In soiiio cdiims'l itniM I hoy itro rnlh 
the orroi’H I'roiii tlu' avoi’iigo, Inil I shnil tidopl (.ini wm 
‘Mtiviation in ovory oiisd. In (ho lii'.,! plm-|. it, in |o | 
iiotiuod tiiat tlui suiti ol l-Iio doviiititnis in inM'(*:t,“<iiril v /.ont j ft 

. 0 . 

^ Thu Hiini ol' tho H((uaroH of tin' doviutimni is ,in 

is t])o nuiiiii s(|nnrn of tlio (h'vlntitnri, which 

othorvviso oallod tlio sooond niininMit of (he di'viiil ions, ahm, 
tho origin in this caso. The word inmnent is IV, .ni 
dynamical analogy; il, is used in I his roiiinicl ion by l‘r()r(>snti 
ICarl Poarson, 

Tho following notation is iidnpl,!<l. 'I’lio inmnonl 
moasurod about tho oidgin an* writimi /t,', /// , . ,, about tli 
contro of gravity /i,, . . ,, so that 


/X,= V (*-«.)= 4, r. ■,«, .0, 

01 )l ' 

/»,= V(,..-S)«= v„,»_ V , 

Thu qiiantitiUH wu need aro not tlio inuinonlH about ii 
arbitrary origin, but tlio inonionts about (lio contro of grn-vit 
But It IS far oasior to ualciilatu tlio inoinnntN ulamt il 








n, will now 1)0 convoiiionl. to I'ullow iJio on l.ho 

|)il)]o iiiul tUngniin ndjoinitiM'. Tli(> (ij-'uiTu inv hiKon iVoiii llio 
vopoi’t of tlio Aniliroponu'li'iciil ( loioiiiit I ni' of lii(> lli-itisli 
AsHoc'iiition, in IHHI. Tln'.v mv M'loclod nioivl^ ns lioiti^- u 

coDVonioiit gi'oii]) l)y wldcli to oxphiin of tlicwi 

iiioiuontH. TIu) tioi^'Id.H of I dido ixn'.sonn worn p'lvcni iih l)ntwo««n 
certain inalioHj botwotni 57 iind 5H iindion, iia innlor I ho oolmnn 
lioadod .«'l-57i, I tiilco tho onp:iii iil 57 1 iiioliox, luid tho 
abscisMiu for tlio HU(!(! 0 HHiv(' groupH nro I, Li, o(), 'pj,,, 

nmnbor of in.stnncoH in tlu'sn variouK /j^roiipa tiro tlioao ^fivoii 
in tho aocoiid (iolmiin, niuh'r tlio lollfO'//: oiio poraon iiiMlnr 
58 incliOH, ono liotwoon 58 and 5i) inolion, and no »in, Tlio 
iuRtfincoH in tliia omho ocanir in ^n'oiipH, and wo aro not atilo to 
soparato thorn liy nnuuiH of tlio data, lioiioi' oindi doviatioii will 
occur in moat ca.si‘s morn than onri'. 'i'hna, a tlovialion hIiowii 
botwooii Oh and (15 inolioa o(anirH Ihdtinios, Inal end of adding* 
tlio ftAv niinply to obtain tlio ih'viiition wo mull i| dy I'noli 
deviation by tlio nunibor //, tho mimbi'r of tinioH it oociii'm, and 
HO obtain tho third colninn ai//, whoao Hiun in wliicdi is 

tho first inoinont alamt tlio orif^iii. Tho attin of I ho dovintions 
is to bo divided by 1,9.85, tho tohvl niiinlM'r id' di'viatioiiH, to 
j^ivo tho first niomont, nanioly, lOdttli, and thin p-ivi's tho 
position of tho oontro of f^’ravity nioasiirod fismi tho (iri|';ln, 
57^ inchoH. Tlio cohinuis midi'r .nf and iry'* ni(|niro no 
explanation. Tho totals 207,987 ami 2,8 tM.'tliV aro divided 
by ?i, giving t07‘5and 1,218, /t'.j and /<.'.( in tho nolntion iidojitoil. 
Tt now remains to roduco those nionn'iits aJiont the origin to 
the momonts aliont the centre of gravity, by nn'iniM of tin' 
forinnhu given abov('. 

Tho practical simplicity of evaluating the moinoiit;i by this 
method arises fi'em tho fact that wi' aro <loaliiig‘ in tho .rV 
with a series of nmnbors a,scending in nniform ordm’, I to 20, 
and. that the whole iiritlimetio oom|mtation is very Himpio ami 
very easily checked, whereas if we jiroci'od on tho direct 
method of writing down tlio position of the omitro of’ gravity, 
which will naturally not ho an exact numhor, oacli id’ the 
deviations will introduce as many docimiil plaoes ns are kept 
in our calcnlation j and tlio sipiariiig ami mibing will bo 
very arduous, and wo liiivo no ready iiioiuis of cliocking mir 
results. Lt is tlioreforo worth while to take the rorniiila ami 
choose our origin so as to give the hnist arithinotio work M.nil 
obtain the second and third monioiits imllri'oldy. 'Plioro is a 



sum 1 1 ooiTOol.ioii io l)o iiiiulo lor iilio inonumLH so calculniod for 
ilio Hooond, I'oiirl.li, luul oUkm’ ovoii immumls. I will deal only 
wiUi ilio hocoihI. It is lo 1)0 olisorvod tluii in iho svholo 
O)il(iidtil.ion il' iw iissmnod Uud/ all 1.1 lo porsons in ii purtionlar 
^I'ouj) aro ('xiiotly iiL l-lto iniddio of Unit t'TOiip, r.r/., bliat 
If)!! porsons in l-lio (I't to (15 Inohos liavo tlio lioiglit exactly 
(jd'i inidioH. Ill is oljvions that that will not ho the case, and 
it is easily soon tliat that will intrtidnce a dolinito error in tho 
cn-loulatioii o!' the scicond in onion t. h\>r if wo take ono of 
those groups in ])artloular, and niako Uu) assuinpbion that tho 
whole niinibor lies ai its midi lie ])oinb, wo aro reprosonting it 
hy a rootanglo insti'ud of hy a trajiozinin with Uio side noaror 
tho oontro of tho group longer than tho other; n littlo 
consideration will show that that inakos tho socond moniont 
too groali. Mr. Slioppa.rd has shown tliat under cortain 
ci roll install cos it will ho sidlioiont corrootion to subtract tho 
fraction i’roni tlio socond i n onion t caloulatod on the 
asMiiniptiou of imirorin distrihntion at tho iniddlo points of 
tho gron])H to oh tain a inoinont in a true approximation. 
On lingo 21 tho cor rooted in omen tj wu, is whilo tho 

nncoiToctod inoinont, /ay, is (Mtli?. d’ho corrootion will ho 

only, if tho dilforoiico hotwooii suocossivo gi’onps is ono 
unit of ahsoissa; if the dill'oronco was h, wo should have 
to iiiulti[)ly /ij by /i/-*; but for ])ra(itical work it is host to take 
tho unit as the distance bi'twomi groups wliich wo aro dealing 
witli, and lionco tho corrootion is in tho form which is 
of practical use. 

'I'ho “ standard doviation ” is dolinod as tho sipiaro root of 
tho socond niomont about tho cmitro of gravity. Professor 
Karl Pearson used cr to donoio it, and <t is 2*502 iiichos in this 
enso. It is HomotiincH more convoiuoiit to deal with tho sipniro 
root of twice the moiinmt, which is called tlio modnlns, and 
donotod hy tho letter r. Professor J'ldgoworth nsos the modnlns, 
wlioroas Professor ICarl Pearson nsos tho deviation. Wo shall 
son the a])])rnpriiiXonoHs of tho modulus when wo deal with tho 
curve of error. The modnlns for this group is 11*028 inclios. 
It is a very romarkahlo fact that tho modulus for tho height 
of groups of men is almost uiuvorHally very nearly 8*0 inclios. 
Professor Mdgworth gives ,a list of 10 such groups in the 
duhileo volume of tho JoudiuI oJ llui Roiful HUUmtical hiocioly : 
the moduli aro 8*0 (United Kingdom), 8*0 (lOiighuul), 8*4. 
(Scotland), 8*0, 8*7, 8*8 (Uni toil States), 8*7 (Belgium), 



3'7 (Italy). I merely call attention to that in pn.ssing*, i.o 
give an idea that the modulii.s is of real significiinco inul 
not a mere arithmetical calculation. 

Aveuage Deviation. 

Dor the next few minutes I propose to nssunio that tho 
curve I am dealing with is symmetrical about its contre of 
gravity. The curve of heights which i.s sketched on page 2 1 
is in fact very nearly symmetrical. If the curvo is actual ly 
symmetrical all the odd moments are easily soon to bo /.oro, 
while the even moments aro not. Then this quantity o- or o, 
whichever we adopts servos to moasuro tho distanco of tho 
curve from its average^ to use a clumsy phrasOj or tlio 
dispersion about the average. Before discussing iho 
appropriateness of this measurement I havo to explain two 
simpler methods of measuring the same thing. Ono basod on 
the first power of the deviationSj and tho other basod on tho 
distance between the quartiles. First for tho avorago or nioiLii 
deviation which, in the notation I am using, is callod o). 
If we Avrite doAvii the deviations in the method just dofinod 
and add them up, Ave obtain zeroj but if wo treat all tbo 
deviations as po.sitive and add up tUoir absolute values \vo 
do not obtain zero. The calculation is as folloAvs : — Treat the 
negative deviations and the positive deviations soparatoly. 
The .sum of tlie negative deviations is 

22/(10-04.19 -n)) = 10'04d9x 824- 0331, 
from tho figures to the left in tho table on pago 21. 'Tho 
sum of the positive deviations is 

2?/(ct~10'0419) = 13100~10'0419x111I, 

from tlic numbor.s in tho rigiit compartment. Niio avGrag‘{' 
deviation i.s, therefore, 

ll:j 100 -10-041 9(11 11 -824)-G331}-r-1935==2-01 (iuchos). 

PROUABLR ErKOE. 

The other simple method is based on the qnavtiloH. 
Calculate tlie quartiles of this group by any of tho motliods 
already given, and you Avill find them to bo ajajDroximatuly 
65-8 inches and 69-3 inches. Since the median as given on 
page 21 is 67'566, one quartile is 1'78 inches below tlio 
median, and the other 1-72 inches above it. Half the distanco 
hetAveen the quartiles is callod the probable error. It is ii 
term which is .so firmly in use that there is no liopo'oP 



improving' !(■, Iml. il' Ih oiu' oT Mk' inosii (M’vonoouH i('nns Id hho 
ill iiml.lH'iimiii'S. IliiH’ l.lHuliHl.inu’d is I'Vr) iiit'lii's. H’wi* taki' 
ti. person III- niiMloiii iVoiii l,liis i;Toiip mid iiu'iisuro liis lieighl., 
it- Is ns liivoly ns iiol- l-lio In'i^'lil. will bo 1‘oinid l,o be bc'i-Avc'oii 

i, lu* (pnirl lies, I’of l-lu' spneo <'<iiil.niiie(l lu'l-weoii llio onlinsiU's 
ill- l-lii' i|iin rl.iles is oxiiel-ly linlT l.lie whole e.iirvi'j heiieo t-lie 
pliniso “ pi'obtiblo on’ov,” 

If wo w('ro donliiij*' wil-li l-lio spee-iiil distnbiil-ioii dolennined 
by l-iu' o(|iiJil.iou lo l-lio eiii'vo of ('rroi' (see p. b-l-), we should 
Imvo Mh' followin'^' I’ohil.iinis : iivimm.j^’o (h'viiil.ion — iiiodnliis 
~i~ iind ]>i'obiibio orror— itiodiiliis x •'l-VOi). 'I’lu'se rehd.ioiis 
ni'o tip|H'o\liiin'l'(dy l-rin^ for 1-liis disl-ribiil.ion of lioig'hls, for the 
vnbios of l-ho mean (h^vitil-ion fouml from o(|mii-ions whon 
IIk' inodiiliiH is imdii's iiro 2‘()‘(- and I'Vd iiudios 

r('spc'(div('ly, while l-ho iiinnliors found above are 2'()l and 
I ‘Vb. 

'I’lioHO inol-hods of dosori]>iii{^' groiijis are, howovovj 
a-p))lioa])lo l-o j^Tonps whiididoiiot e(mfonii,oven approximaioly, 
1,0 l-ho law of ('i*ror. I shall now l-roal- Uiom witlioid- ilio 

ii. ssuiiijil.ion t.bal, 1-hoy <lo eonfoi’iii. 'Plio proliablo orroi* is iho 
nieasiiro id' dispc'i’sion, whieb is mosl- (piiolcly ealculatod. Wo 
<*an wrilo ilowii I, ho ((iini'l.ih'S very ra])ldlyj himI 1-nko half l-lieir 
dillormuai al. one,o. Ihil- 1-hal- only 1-akos inlo ucoounl- l-lio 
posil-ioiis of l-ho 1-wo qiin-i'l-ilos, and does not- l-ako into aooonnl- 
l,lio posil-ions of I, lie oxI.rotm'Sj bid. only l-hoir .si/,Oj and, dopoiid- 
inp; as il, <Iooh only on l.wo (jiianl-il-ies, is liable l,o a large 
anionnl. of iiec.idonl.al error. The mean dovial/ion on I, lie ol-her 
hand l.aki'S iiil-o ae.oonnl- l-ho posiliou ns well as I, ho niniiber of 
all iho (piaiil-il-ies, and is 1-horefore less liable lo aecidental 
error, and also il- does noi, l-ako al- all long l.o ealenhil,o wiUi 
simple iinnib(U's. 'i'lio modulus and l-ln^ sl-amhird devial-ioiq 
again, l.aki' inl-o ae<*onnl- every obsorva,l,ion, bid, l-hoy give ('xl-vn- 
wi'ig’lil- 1.0 l-lmso whiidi are a groid, disi-anee from l,ho average. 
In some eases l-hal> Is rig'hlij in ol-hors iL is iiol-. 11 we are 
liasing argannonls us l-o (.ho grou]) and l-ho shape of l.he gi-on]) 
on probal)ilil-y, 1-hon very likely il, will bo eorroel, l,o give l.his 
o.Nl-ra wi'ighl- l-o an objool- whieb is far Irom I, bo avoriigo, lor 
l-ho farl-bor from l.h(» avm'ago 1-bo loss l.lie probabilil-y, and in 
some <*ases l.lii* jirobaliilil-y dimiiiishos voi'y rajiidly as Ave move 
from l.lie avm'iigo. If wo are not- going l-o make (issum|)l.ioiis 
alxml. I, bo Hhajio of l-ho eurvo, nor a])ply l-bo ]n'ineiples of 
nrolitdiilil.v. I ilo nol- know l-ha-l wo sinill (ind a.ny jnsl.iliealion 


for takino’ tbo squarOj ratlier than the inoaii; clovialiion. Ah a 
rough rule we may say that wo pass fippropriaioly from i.lie 
probable error to tlie mean errorj and from tho moan orror 
to tho standard error as the curves with Avliich wo ai’(i cloaling’ 
become more definite and perfectly continuonsj and 
approximate more and more nearly to n curvo with a dofinito 
algebraic ccpiation. For very rough moasurements wliicdi arc 
not continuous and which aro not to bo correctodj tho ])rohahlo 
error, measured as half the distance hetweeu tho cpiartilos, 
will very likely be the best measurement. As iho curvo 
attains a definite shape, and as wo are ablo to treat th (3 
observations as more and more continuous, it will bo woll to 
take the mean error, and finally, if wo have a pcrfoci, 
algebraic curve, then very likely it will bo most correct to 
take the “ standard deviation.” 


MKASUREMJiNT OF SlCEWNnSS. 

Now to pass on to nn symmetrical curvoH. Wo have 
obtained by one of tbo averages tho position of tho curvo and 
by one of these measures of disi)ersion ono moasuro of its 
shape. We shall now obtain the moasiu’e of its want of 
symmetry, or briefly, of its skowness. Most curves have sonic 
degree of skewness; but in .some cases it is nogligiblo. 

As an example of a carve with considovablo skownoss, wo 
may take Biagram III, on p. 6. The curvo is elongated to 
the rigid; the mode is to tho left, the centre of gravity to tho 
light of the median, This is the general order of thoso throe 
averages. If a skew curvo is formed by stretch ing a 
.symmetrical curve to the right, the stretching shifts tho contro 
of gravity, relatively to the median; or, from anothor poiiil, of 
view, if a curve is heaped up to the left and strotcliod to tho 
right, experiment will show that the line through tho modian 
is to the right of tho highest point. 

T-hero are very many possible way.s of measuring this 
skewness. One obvious measuroment is simply tho distanco 
of the centre of gravity from the median. Anothor is to iiso 
the quartiles. Call the positions of tho quariiles, Q,, Qg, tho 
position of tho median, 0, of the mode, M, and of iho contro of 
gravity, 0. In a symmetrical curvo the distanco QgO is eqnal 
to the distance OQ,, whereas in a skew curve it will nob bo. 

11 a .skew curve stretching to tho right, the upper qnartilo 
to the right IS further from the modian than the lower 



iiiiul llui dilVormnu' l)o1.wooii ih(3S() two intmHiiroH will 
I'oi’in iuiotlioi' iiiofuiH of OHLiiriaiiiig its HkowtuiHH, MNk' thivd 
iiioiliod i.s to (iiiJco tlio (ii’Ht ])()\v(n'()f tho (.loviaiioiis, mid tioiiijmro 
(iho (i.xcoHM on ono side I.Iks (kmiI-vo of gravity witli ilii'' d('r('ci 
on (. 1)0 otiior. Tlio I’ourtli inoldiod ia to talco tlui tliirtL powor 
of tlio dtivijd-]i>nH and (iOiiHidor its absoluto magnitinU'. All 
thoKO inotliodH Itiivo tludv iisoh. i iivopoHo to doal witli Uiivo 
of tlioiii. I will Ilrst talco that wliicdi Is uritliniotioullj'' tlio 
HiinploMt. Tlio HiiiiploHt nicaiMiiromoiit, that wliioh you cam 
i'.aloiihi-l,o aliMOHt inHtantly^ is I ho clin'orinico hot woon tlio 
dintaiiooH from tlio (piartiloa to tlio niodiaii. Hut that g'iv(3H a 
ooiioroto (iinintity ; in tlio caao lioforo as ko many inolion; 
wIiorniiH it in oonvoniont to iiioasuro tlio tikowiioHW iim an 
al)Moiuto (jiiantity, on a Kcalo Crom -I- I to —1; and wo niUMt 
timin' 1 * 0 1 'u i'(mIiu}{) tliia ooncruto quantity to an ahaoliiUi 
(|iitin(a[y. ddio ]n' 0 |)or nioihod of doing tliat is to divido it hy 
tlio inoduliiH, which ia a concroto ([uautity_, in tliia onwo ho 
many inclioH. The one divided hy tho othor gives an absoluLo 
niouHiiroiiiontj wliitdi would sovvo to muasuro tlio skowncHH. 
Hut it is hottov to miiltijily that niuasuroniont by tho uoimtant 
H'2i) (hoo p. HO, Imlow) la'I’oro using it, to bring it into 
conl'oniiity with tho thoory of proliability ; in the satno sort of 
way UH tho ninltiplication of the Hoooiid nun non t by 2 to got 
iJio iiiodidiiH lirliigH tho staiidavd doviation inlo oonforniity 
with tho motlioda of jirobability. 

Another n-nd yot Hini])]oi‘ motliod of inoasaring alinoat 
ovaotly tho Hanio (|uantity, is to divido tho dilforiaico ludcwoon 
tlioHO two quail titi OH by tlioir sum, that is to say liy twioo tho 
])robablo orror; tium if wo multiply that by JMd (hoo p. HO, 
hohiw), wo sliall obtain tho sanio nioiisurouiont vmy noarly 
as bol'oro. This niothod snpplios a good roiigli nuniHuroiiiont 
which is very rajiidly oahaihdod ; wo write down tlio luodian 
aiiul tlio two (piartilos, oahuilutiiig thorn roiigJily or by ono of 
tho nioro ooiuploto motlmds given above, and at ouoo writo 
down the ])robablo erroiq hy this moans tlio skownuHs of tho 
gi’ou[) oiiu bo oalouhitod in five niiimtos. Hut this nimiKn ro i non t 
do])oiulH on tho jiositioiiH of tliroo points only, wliiob are 
Huiijoot to aiandontal orrovH, unit tlio jiartH outside tho (piartiloH 
have not muob iidliionce on tlio roHidt. 

A nioaHuro, which is inlluonood by all tlio iUmts, iHolitainod 
by taking the tJiird mom out about tho contro of gravity',* this 
in itself is a iiioaHur(i of tlio HkoAvnoss, but it is not of tho 



right (liiiiensions, for it is a concreto quantity of tlio onkiv of 
n cilia', as tlie deviations liavo been cubed ; to roduco it to an 
a 1 ISO lute quantity it must bo divided by Galling' tbiis 
measure j, vx' liave which in the group g'ivoii on 

p. 21 is equal to + ‘016. 

Ill a curve whicli is iioavly syiumotriciil and apiiroxiinutoK 
to (lie curve of error, the distance between the aritlnnotio 
average and tlio median will bo } jc, and tbe distance botAvoou 
the arithmetic average and the mode will be jr, and tlioso 
relations supply a third method of estimating’ the skoivnoNH. 

DlAUBAiM YIII. 


Daily WiKjes of Dolr/ian Goal-miners. 




Pil'Hl, Mm iikhIiiIiih^ mid LIumi {iiiIciiIhUj IIio position oT 

oildior iihmIo or iiiodimi mid (Jui itril-imiolio avoriif^'ci, divido tlio 
dislmico by <■ tn* by r, uml wo obbiiii j. [bit iJnit is not sin 
jUMMinrto iiiotliod, if wo iiso tlio iikmIo, wliioli cminot bo 
pri'nisi'ly dotoriiiiiiod j wddlo if w'o imii tlio Jiiodijin, wo tiro 
{[opondiii}’’ iitnni n, siiip'lo poHition. Tlio formula) to bo 


proforrod iiro j 


oirrus'*'' J=^"' 


tlio fortiior 


W'j 


porliiips wiinii iJio oiii'Vo is not iipproxiiinitoiy (dio oiirvo of 


oi'i'in*. 

'Dio tuljoiiiiiip* Dlip^’i'iiiiis illitstrti.to tlu' ]n'}M‘Li('!ii iisi' of tlio 
toidiiiioitl (pimititios wliici] I liao'o now disoiisHod. in IHbO 
tlio Ib'lp'iiiM (t()\ onimoiit iiiidorl.oolc mi Indnslri'inl Oousnsjaiid, 
miioMfV^l' ‘dJii'i’ tiling’s, tlioy collooiod li^airos of tlio wn^'i's of 
niosi- of tlio woi'kpooplo of nol^’diiin. VVt' liiivc' lioro in 
p-riipliic fonn I bo daily wiip'os of l-lio Ibdp’imi ooal iinmn's in 
iiSDd, A Hii|)[doiiiontiii’y oinpiiry was oouduotod in iDOOovor 
iioii.rly tlio siiiiio iiroii., iiinl tlio rosiilt is ^'ivon just bolow, Tbo 
iiiolrliods wo liiivo dovolo})o(l {^ivo ns a I'lijiid inosuia of 
ouinpariiip;' tlio ri'sults of tlioso two I'lnpiirios. It is tbo 
nuilrii II foldin' (i^’nros only witli wlnoli wo liavo to (loal at 
]m)soiit. Tlio avonijii’o iiiovoiisi'd from Jl’OH fmno.s to 
b'l'KI iVmios botwi'oii tlio datos ; tlio iiiodnliis from J '20 to 
2‘()'t7 fraiMis, tbo skmvnoss oliaip^'od from a nof>’ativo skowiioss 
of ~-'l() to a positivo Olio id' '22, d’hoso tbms statoinoiits 
rip'btly nndorstood imd iiiiorproUid ^'i\m in a briof form tbo 
roMiilt of tbo OmisiiH. 'I'bo iivorisp’o Ims inoroasod, moro nionoy 
wmit ill wii{jj{'H, iiml tlio mndiilns and standard doviiition lias 
iiiori'iiMod Vary niiKili. Tlioi'o was a ibivolojniuiiitj tlioroJ'oi’o, 
of wa/-^' 0 H a\va,y from tlio imirap;’Oj oithor by liif^'ldy skillod 
worki'i's iiioroiisiiip; tliiiii' wiip^’os <,p‘on.t1y, or by a liody of 
mislcillod woi'kors ooiiiMig into oxistoiico. If yon look at tbo 
oiirvo yon will hoo tlio disporsioii is oliiiifly incroanoil to tlio 
rip'bt, and tliat iiioroasoil Htmidard doviutioii is duo oitlior to 
tlio inohision of a liif>’lior ^'rado of workmoii tban bad boon 
imdudod Ijol’oro, or to tins (‘not that tbo liiKlun' {^‘rados of work 
bad obtainod a groat iiioroaso of wagi's, I um iiioliiiod to 
tliink it ]» 0 HHil)l(s tlnit tbo inoroaso of disporHion is partly duo 
lo tbo orronooiis inolitsion of poojilo in tlio sooond ominiry 
wliioii woro not iiiolinlod in tbo first, but I bavo no nionns of 
going bobbul tlio (igiiros. 'I'liu oiiiuigo of j comas from tlio 
same sort of ronsou that a bodv of skilloil workman woi-n 



obtaining higher wages, or that tlio nnmbov of slcillofl 
workmen had increased. Either of tlioso jnoans would 
iticrease j in a positive direction. Tliis n.so of tlio lotter.s 
may 1)0 left fur coii.sideration. 


Returning for a moment to the use of doviatioiiK in 
connection witli tlic median and aritliniotic average, L have 
to point out the curious relation between the two. 'J’lie 
arithmetic averago is that quantity from wliioh the .sum of 
the deviations i.s nothing, and the sum of tlio square.s of tlio 
deviations the least possible. The second ro.siilt is obtained 
instantly from the formula already given, fjui — '[’ho 
sum of the squares of tho deviations from tlio aritlnnetio 
average is fi 2 \ the sum of the squares from some other origin 
is fii] and from that formula /j^ is always les.s than IPho 
median on tho otlior hand makes the sum of the (irst jiowor.s 
of the deviations a minimum, and the .sum of tho zero powtms 
zero. If we take the zero power of tho deviations, each 
deviation is replaced simply by 1, and thon from the doiinitiou 
of the median wo find the sum of the zero powers measured 
from the median is zero. That the sum of the first powors is 
a minimum can bo readily demonstrated, most easily by an 
analogy. Suppose that it is required to run from a tolopliouo 
exchange separate wires to everyone of n places in a straight 
line, where should the exchange be placed, so as to use tlie 
least total amount of wire ? At the median position. Eor if 
you move from tlio median position to the right or to the left 
you will find immediately that you are adding more wire than 
you are subtracting. Supposing there are 20 stations, and 
you have a position between the lOtli and 1 Itli j if you inovu 
to a position between the lltli and 12th, yon have to incroaso 
your distance from 10 stations and diminish it from 9, in every 
case by the same length of the wire. Tho wires correspond 
to the deviations ; and the sum of lengths of tho wire.s is tho 
sum of the lengths of the deviations. Consideration of this 
illustration will show that the sum of the doviniions is a 
minimum when they are measured from the median, hut that 
the median is not quite determinate, for if thero aro an ovoii 
number of stations tlie sums of the deviations raousurod from 
all points between the two central stations aro the same. 



MMASlJliEMl'lN'r OF (JKOUPS. 


Til rill) IJCOTURK 


'rilM OuRVI'l 0|.’ I'jRIMW. 

rii 

I II hi Hiibjiiiil, (Iin(!UHHO<l in Uiin soclioii m I'lill of loclinioril 
(IiIli(!itli/i( 5 H, ruid il. will lu' impoHnilrlo in covi'i* (.lin Hiihjcol, 
ruliKjiuiXoly in Uio hIuji’I- hihkui iillol/k'd l-n il.. ll. ninsi, l.lii'n ho 
I'lrg'nrdoil tin oonl'iiiniiif*’ mUior n. Hinnituivy of Mioro iniporitinl. 
poinliM connooUMl wil-li (/Iio l.lioory of orror, wliiirh 1 slnill Inivrs 
1.0 DHO .sul)KO(]Uonl'ly. VYliilo nin>kin^' il. tm ooin])lol.o oh poKHiblo 
in il.Holf, in Hovonil (riinoH 1 hIhiU luivo 1.0 aslc iiocopUinoo 
wil/houl. proof of roHiill-H which 1 nhidl find il. nt'ccHKnry 1.0 use 
ill a I'ul.iin^ dal.o. 

Amoni^ Lho varioiiH HliapoHanHuimid hy groitpHofobsorvations 
of any kind which arc (ns in (.he groiipn already (.aktm) 
grouiaid in a more or Iohh rof^nlar way abonl. lho c.cnlral lino, 
Ihoro is oiur dislribnlion of l.ho various dovia(.ions abonl 
lh(Mi‘ { 5 cnl.rc whicli is roi^'ardcd as normal, and the curve 
roirrcsmtlinpc il is called llur curve of error. And il is lire 
dednclion of lho oqnalion of lhal rlislribnlion wliich I have 
lii’sl 1.0 deal wilh, Aflor wo have llic capialion wo will discuss 
lo whalo.Nlonl lho normal cnrvrr is aclually I'omnl in lho kind 
of slalislics wil.h whiidi wo deal. 'I’lio normal curve can be 
oblainod from lho Hlalislics found in games of (dmnee, or 
from lho slalislirrs which may bo oirlaiuod by comiling Ihe 
(Mjcurronoo (d’ H[io(jiIlo<l <ligil.s in mallmmalioal lablos, or Jrom 
anlliroprunolrio moasuromonls, or again from somo groups oi 
social slaliHlitis and from somo groups ol vital slulialics. 'riio 
doduclion of llui orpiatiou I. am going lo lake is lho only ono 
which f lihink hauls ilsulf lo purely algebraic Irealnionl. 
Olluu' doduclioiiN dopoiul ujnm lho use of (lilforonlial calculus 
or oven of lho Ihoory of fimolions. 

JjoI. uh couHidor some ocourronco for ■which lho chance 
is^), llio oliaucc againsW/, so Ihal^^’I'V”-^’ 



tliat the event whic]i may or may not give tho oGoiirroma* takes 
place n times again and again, and tlmt in oaeli ;/ tiinos wo 
count liow often success is ohiainod. hh)r iusliiiucc, siipposo 
we pitch a coin n times and count how many lieada arc foniid 
and tlien repeat the 7?-fold oxperimeni again and again suid 
register m eacli ciise tho number of lieads, that would give a 
series of the kind [ have in mind. For a small iiuniliur of 
oxjiernnentH, if cacli sot of oxpcrhnonts contained i 0 tries or 
any smnll finite number, it is easy to set doAvn tlio pvobubilil.ies 
of the various numbers of successes. 7 \ud it is also (‘luni’ as 
.soon as the algebra of the method is tackled, that Llioro is a 
limit towards whicli these chances tend as the number of 
experiments in each group is indefinitely increased, What wo 
have to do first is to find the limit towards which siieli a series 
of experiments tends when tho n is nicreasod indolinitely. 



llie diagram annexed represents the various ehanoos of 
the numbers of heads in the experimonts of pitching a c'oiii 
12 times. The most probable number of heads is of cour.so six, 
the least probable none, or 12, and the probability of 0, 1, 2, 
up to SIX, is continually mcreasnig. If wo erect 13 ordinates 
rejiresentmg the probability of no heads, one head, and so on 
u}) to XL heads, we got the diagram marked ( ^ ‘'b Jf we 

take another kind of experiment where tho clianeos for speecss 
and failure are not equal, e.(/,, where tho cluiuco of siicmoss 
IS -d and per orm the experiment 10 times, wo got the 
piobabihtaes of one^^ 10 suceassca 

lejneseiited by the following diagram ; 


(•3+-7)i'> 



firKl, curve is of coiirHo symnioli'ioiil^ I, lie hccoikI cum) 
uiiM,y»inruil;nc,(iL Wit at wo have io do ih (,o dodiioo Uic 
Hiia.|»(t of Uio oiirvo wlioii ilio iiidux ih iufiuifo, wlioilior ilio 
olttMUMt in I'avotiv is oiio-liall', or wliollior Uio oIkuicoh Tor juul 
ii*^*aitiHL urct iniLUpial. 

ir {> is iho prof)al)ilil.y of an ovoiiIj, and ilton Uio 

k, 

nrohaltilKiy of w hiiooohhoh in a IrialH is , , and 

^ ' I'mja-?/? 

MiicMUssHivo valiiOH of m ^’ivo Uio lorinH of ilio buioinial 
(txjtaiiHion {p -{-(/)". 

Ahhimtio Idial, up in iiilof^’ral. Loi np = }\ ii</=:.s, r't-« = a. 
Doiioio HiioooHNivo tornm by 7<o, 'ii\ . • . it,i, 

bj, 

'I 'll on Its, wbicli irt Uio f^’rouLcHi l.orin, = r/rJ^’Vy®. 


^1 — I Vl ) • • • I CacLorH 
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The assumption that np is integral inaclo above cTooa not 
affect tlie limiting form of the equation. 

It is at this point necessary to oonsidor wliioli tonus ai’O to 
be rejecteclj when n is made iiifinito. If a> is finito, if wo niovo 
tlirough only a finite number of torims from tbo groatoHt 
ordinate^ the ordinate equals tho ordinato iis> This ])arL 
of the curve approxunates to a horizontal straig’ht lino. lo 
take a numerical instance; the ohanco of obtaining 490 lie ads 
in 1;000 tosses is practically equal to tliat of olitaining f)00 
heads. On the other hand if x is infinitOj it appoars that 
lig+x is zero. If the figure is drawn, so as to sIioav Cinito values 
of X we obtain a horizontal straight line j but if an aitoinpt is 
made to include infiuite values of x, the curvo bocomoa tho axis 
of and a finite vertical line through tlio origin, 

But it becomes clear; if we examine tho shape fo3* difforont 
finite values of U; that the curve has a definito shape and finite 
curvature near the centre. Before wo go furtlior lot us take 
an analogy. If we take an hyperbola and try to includes the 
whole curve in our figure the curve will coincide witli its 
a,symptotes. In order to draw the curvo so that llio part 
between the asymptotes and the vertox can bo sooii; wo mnat 
adopt a particular scale so as to obtain the length from the 
vertex to the centre as a finite quantity. Again; if wo paas 
from the ellipse to the parabola by the process of piisliing* tho 
centre to infinity you have, in order to obtain tho fiiiito part of 
the parabola at all; to make the hypothesis that y ^\03 is linito. 
In order to get the finite part of the curve of error wo shall 
luive to select that j^art where tho ratio of 0 ® to n is finite. 
Then it will be found that we shall obtain tho part of ilio 
curve that has a definite curvature and a definito sliajm in a 

finite form. Let us assume, then, that — is finite ; and lot uh 

71 ’ 

substitute for — the quantity with tho factor 2pq. Tlio roason 

for that factor will soon be obvious. Take c^~2pqnf so that 
x—zc, “We then obtain the equation 
when all vanishing terms are neglected. If tlio above 
deduction is carefully examined it will be found that all tlio 
terms omitted are infinitesimal in comparison with tlioao 
retained, when n is infinite. 

Removing logarithms, and siting 7 j for wo have 

y—7ise~^ ='Ms.e- ? —•Ws.e 



Wo Jiro Hl,ill jii li]>or(,y l,o cJiooho a, .s(}(vl(i fov ilio ordinaioSj 
luul iH uiohL (lotivoiiioiil, 1,0 oliooso \vln{ih inakoH IhogTOJiioHt 

ordiiiiiii' ~ ~ , foi' I, lion tlio iimi boiiiulod by iho curvo, mnl 

(' \/>Tr 

(,lio axiH of m boooinoH unify; ilum ouoli paviof Uio iiroii roproHonts 
1,1)0 probitbilify of ooi'l-ain oooiimmooa, for Uio wliolo ourvo 
roproHonfa 1 , wliioli hUukIm for oorfainfy. /Vii alfornaUvo i« io 
N 

Uilco flio ordinal, 0 a.H . , ho find, fho uroaof LIio ourvo is N, 
a /tt ' 

whom hf in l,lio niiinbor of oxporiinoniH. 'I’lion I, ho aroa Hfanding 

on iMiy pari, of Mio n,xiH rojiroHonis Uio niosl, probablo numbor 

of ovoniH oorroHponding' Lo Uni, I, pari,. 

Now lol, iiH g’o baok and fako Uio fornia wo havo so far 

i-ojool,od, wliioh iiivolvo blach of Uioho oonLaiim tho 


p —7 

faofor, oonvonioni, l,o call Uiaf ((iianiiLy 2j, 

for Uion wo Hindi (bid Uial, j has Uio nioaninp; already asHignod 
1,0 if (hoo p, 21, and for ])roof hoc ]n 20). 

Ho- writing’ iho o( pi at ion with that notation, and then 
ox.] landing tho ])art wliioh contaiiiH j and noglooting tho 
powoi’H of j, WO have 
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Itini'anily soon that ](!•= — </)=_/>—• .i = i— 7 . '(’hocontro 
of grii,vity of that onrvo can l)o hIiowii to lio at tho origin liy 
iiitogration. Tho aroa of tho curvo is of cohi’ko tho integral 
of takoii hotwoon jilim in Unity and niimis infinity. Tho 
part of Uio iiitogral which does not contain j is a well-known 
dofinito intogi’id, which equals unity. It can bo Hoon that Uio 
part containing only odd jiowoi’H of m does not all'oot tho 
(hillnito integral. Iloiico tlio aroa is unity. 

Now lot UH oalouln,io tho error of moan Hcpiaro of tho curvo 
from tho equation. It in obtained by multiplying tho olomont 
of aroa //.d;)! by itH diKtanco (.r) from tho contro of gravity, 
and adding up all tho parts ho obtained, and then dividing by 
tho whole area, i,o., unity. 

It is easily soon that the ] term does not ontor into tho 
rosnlt, wliioh is therefore by integration by 

])artH, Ooiiijiai'ing this with ]). 2J, wo see that c, Urns 



calcixlatocl, is the modulus, as thei^e defined. 1'ho iliird 

moment is y.re®, da’, divided by the area, which is unity, 

integrating by parts wo obtain that tho skowiiess, as definocl 
on p. 28, is equal to the j in this equation. Tho con slants 
ill the equation to the curve of error, as written above, aro 
then the modulus and skewness as defined for curves in 
general. The average deviation, as defined on p. 24', is found 


by integrating 
not involve j. 


•J5 (T; 

y.ffi.dre-p y,x,dxj to bo ~~ 7 ^} does 
Ju Jo 


The equation in its integral form is, if for 


area on abscissa from origin to x 



the lower sign being taken when x is negativo. 

This does not admit of any simple evaluation, but it lias 
been tabulated for a wide range of values of Prom ilioso 
tables it is found that tho " probable error for tho 
symmetrical curve (where j is zero) is c x .4769, which is writton 
pc. For the nnsyminetvical curve tlie distances liotwcon tbo 

2 . 

median and the qiiartiles can bo shown to bo 

while tho distance between the centre of g'ravity and mode is 
jef, and between the centre of gravity and tlio median i.s 

g • jct> as used on pp. 27-29 above, where tho resulting’ numerical 
values are given. 

The effect on the curve of tho j term is to stretch the 
curve to the right, heaping it on the left at the same timo, the 
sort of figure wliich is indicated in the second diagram on ]). 32. 
Actual examples of the curve for different values of c and j 
aro given on pp. 21, 28. 

The tables give the integral for the argument ^ , not for 
•T, and before they can be used the observations must bo 


* See Hiu'ffess’s Mathejuafical Tables } Merriman's Least Souaccs. i) 18G' 
Sowletis mmenls of Statistics, p. 281, and p. 332 (2nd Edition); and Joiinml 
of the Royal Statistical Society. 

t See Mements of Slaiisiics, p. 331. Hence, OQ^ -OQ, = wliieli 

gives results on p, 27 and p. 29. 



I'odiKitHl to tiio {•(ini,r(5 of pi'nivily iih ()vif>'iii innl c us iinit, 
'I'Ik'II if \V{» Hiul ill I-Ih' hiiilo rliul> l.lus iii(,o^'ml rmicUon, a.g., is 
'4'r)r) wIkui Uki -l-l’liSV'', wo two io undoi'Htaiid 

Dial, ‘455 of (Jut wliolo tvi't'ii hI.iiikIh tin Uto axis of ,ij IteUvooii 0 
iuid I’MiSy of (Jio iiiodiiliiH, Tho iabiilar Hbilomonl. Uioii shows 
(,li(i viu’ioiiH I'nioUoiis of Llio wiiolo ohsorvtbUoiis wliich may bo 
oxiioi'liod (ill on iiidiillo tiiimiioi' oT oxporimonliH) bi Ho 1)(tlwooii 
l,iui iiiosl, probablo valiio and viifioiiH \'idii(!H wiLli an as.si^'iuid 
tloviiildon IVoiii iJu' ooiil-i't'. Tims wildi iho syinmoi/rioal onvvo 
of 01 ‘ 1 'or, oim-iiiiaidor of (rint obHoiwiiririons may l)o oxpociod iu 
bo tiJiovo I'lio mosi; probaltlo vtilno by iioi/ tnoi'o l/lnin ’47 ol' tlio 
inodiiliis, oiio-ldnVd by mil, inoro Unin 'OH ol' ilio tiiodulnKj all 
bill. ‘J ])(')• 1000 aro sopiM'iiiod by loss 2‘2 of ilio modiihis 
rroni Uio iikimI, probablo viibio; I, ho ohanoo of a dovialdon of 
5 tilin', s tho iiiodtibiM is loss than I in a, billion. 

(Sii|>poHiii;)' wo a.ro fp’von a sot of (ibsorvations which wo 
lia-vo i'('iison to stipposo slioiild ariso I’roni tho distribution 
dolinoil l»y Ibo Hyinnnitrioiil oiiiwt) ol’ orror, what paHicuhir 
ourv(i of error aro wo to lit to our obstu'vatunm i* 'L’ho problem 
is not rory iniportant in itHolf, but tho method of solution is 
very similar to tint mothod wliiitli nmh'rlios tho pvliicijdo of 
loast sipianis ami of sovorid other I'orniulu}. Tho only tliinj^s 
which wo have a poasibility of clioosing' ai'o tho abHoissiiof tho 
contro td' H'>'n'Vily timl tho inodultis. 

lu't ,f’i, -Cy . . . bo tlio deviations of tho observations 
nioasurod frotii thoir n,vorn[''o, Thoso[)arato ehancos that thoso 
should arise if the o({uatioii of distribution is 

I . 0 i~y I ('. 

a are . a 

V 'ir (Wir 

whore r is ji>'ivon stieeeHsivo values Ij 2 . , . 

'rile elm, nee tliat sliould oiieiir tofj;’utlior in a fi'ivon group is, 
by multiplication, 

e ".vr = 

Now on what princi[)lo aro wo to Ibid oid, tlio values of 
c and h'{ Of all the eiirvos of error from which those 
obHorva,tions may bo siij)[Hmed to have arisen thoro is oiio curve 
from which they would ariso with tho least impridiability; to 
lind this wo have to make I* a maximum, h ami a aro (piito 
iud(»pondont. Then tho dilTerontials of 1* with regard to k 

* wiiii.ii a M,., 


MikUM kifliati I 





and c must each bo zero. The first gives that h is zero* and 
the centre of gravity of the observations is the origin. The 

second shows that is the mean square of tho aj’s.t So 

v 2 

that to choose the normal curve which fits tho observations 
best, in the sense that they would have . arisen from that 
distribution with the least improbability, wo must take for the 
centre of the curve the centre of gravity of the observation, 
and for the modulus the error of the mean squaro multipliod 
by ^2 . 

It will be noticed in the proof that in a souse there is 
only one symmetrical curve of error. We can roduco any 
curve to the form ^ = by suitable choice of scales for tho 
co-ordinates ; but if we are taking two groups moasui’ed in 
the same unit, for instance, both in inches, or shillings, or 
years, then the x axis has concrete units, the unit distance 
stands at one inch, one shilling, one year. And if we take 
two separate curves both measured in inches, work with the 
same unit of abscissa, and make the areas each unity, wo do 
not get tho same maximum ordinate. The finite part of the 
curve -with the lower maximum ordinate stretches further to 
the right and left than the corresponding part of tho other. 
As long as we deal mth concrete quantities we shall find 
that the quantity c enters into the shape of tho curve ; and 
the comparison of any two curves is made by means of the 
values of c given in terms of tho unit of abscissa. The quantity 
j is independent of all concrete quantities, and is an absoluto 
measure of skewness, as already pointed out. 
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+ 22a!, — 2»^:).P= — 2 hA:.P = 0 when A) ia 0. 

H 22(a! — 

c j . P = 0, when n, since k ia 0. 


j = *00 
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Wo will m>vv iiHO Mio lioijL^'liL-HlaiinLicH {^'ivoii on p. 2J. as 
au oxamplo oC Muj inoUiod of ooin paving a sol/ of obsovvaiioiis 
with 1,1 10 on wo of ovvov, hi Llio 11 vs I, jilauo wo iako llio conU’o 
of gravity as Uio origin, naiiioly : — OV’rvt iiioJios, Tho modulus, 
by iiio mo thud of moinontH is Jk02il inohoH, wliioh is tlioroforo 
to 1)0 Cakon as (,ho nuit. Thus 50 inulios is 8’ 542 hiohua bolow 
iho tbvorngo, ill at is, 2' 857 iimos ilio modulus. Tho laiior 
numbor is oniorod undor r in ilio sociond column, All iho 
oilioi'H avo oaloulatod in ilio saino way. 'i’hon iurning io tho 



tables and finding wliat integral coimspoiuls to tho asslgnod 
values of r, in the symmoirical curve of orroi'j wo write lliom 
under the heading I'(r). So wo have that Ijotwoon tlio 
average and 59 inches *5 of the wholo curve is ohiained, that 
is to say, one-half; in the next linO; botwoou average and 
60 inches '498 of the curve is obtainodj and so on till the way 
down to between the avorag'o and 70 inclioSj when again InilF 
the curve is obtained, correct to tlio third decinuil i)!uco. "VVu 
should not get the the true half till wo liavo gone to infinity^ 
but the area of tho curve beyond does iiot amount to ono per 
mille of the whole. In this curve, for examploj •402 is the 
probability that tho lioiglit of a person choson at randoin lies 
between 07 '54 inches and 63 iucho.s, for *402 is opposite 
63 inches. The fraction of tho curve is tho sumo as tlio 
probability of the ocourrenco botwoou tho point given and 
the average. 

Tho next column, called Y{r), is obiainod in a .similar 
way from tables including tho term involving j • tho value 
of j is taken to be -l-‘06 for reasons given below. 33ie column 
following under “calculated"' consists of tho difCoreucos of 
the Y(r) column multiplied by 1,000; tho numbers so obtained 
arc the numbers to be expected appi’oximatoly botwiion 50 
and 60 inches, 60 and 61 inches, &o. Tlio following column 
“actual" gives tho actual occurrences per 1,000 iii the saiuo 
limits. The following column gives tlio dilforcmoes in the 
various groups botwoou tho calculated and actual numbors, 
Tho greatest divergence is near tlio coiitru, wlioro there are 
12 more than were calculated. In tlio hist oolnnin are given 
the differences if 1 had taken tlio normal cuj’vo instead of tlie 
skew curve. It is scon that by taking the curve as a skew uui’vo 
the Slim of these differences is diniiniHhod from 80 nor 1 ,000 
to 64 per 1,000. 

I have now a rather difficult point to take with roforoiico 
to ono of thoso columns. Tlicoretically, j is calciilatod by the 
method of moments, the error of moan cube; but in practice 
that does not give good result.s. A single observation a long 
way fiom tho average has a very groat offoct on tho mean 
cube. So that if in this number of 1,085 porsoiis wo had 
included two persons from a nationality where .stature was 
very low, or where it was very high, we should have instaiioos 
at a long way along the group wliicli would not pvoporly 
vitiate the comparison of the curve of error, but would liavo a 



very unforliiimiUi ofVocjt upon Uio nioun cubo. Insiciid of 
liiivinf*’ II. lunnogoiKioiiH gi'ou]), wo sliould hivvo a group of 
[M’oplo from ono group anil 2 jioasoiis from aiioilior 
group wliioli would uol< boloiig io Uio Hamo ourvo. Tlioro lias 
boon a gri'tU/ doal of diHouHsiou as io wluii sliould bo dono 
wiili snob almoi'inal oasos. A good way oui of Uio dilliouliy 
is noi (lO oalonlaio j by ilio aliovi' inoibod at ullj but to 
oaUsulato il. by an a yaw/dir/orf uioiliod, to oliooso that valuo 
of j wluoli niiikos l.lio mislit bm,s|.. Wo luivo already oliosonr 
so as to iiuiKo tbo iinprobaliility It'ss. Ijot us oliooso j by 
soiuo similar l.ost. Tlio moiliod J liavo adoptod luu'o is duo 
partly to I’rol'ossor Karl Poarsoii, and partly io Profussor 
I'Mg'i'Wortli. li is to obtain (iguros (not givuii lioro) in suoli a 
form tlial. it oan bo soon wliat valuo of j will make iho sum of 
tlio absoluio dilToronoos loast. 'I’lio valuo wliioli satisllus tin’s 
oondiiion is found io bo j = 'I'lio valuo oliiainod from tlio 
momonis motliod is ■()!(>, Tiiis might liavo boon used and 
woubf liavo givon a rosult slightly boitor than tlio valuo ]=:(), 

I bit I am iiiolinod to say it is bol.ior to (lahmlato j from iho 
a fmlv.riim moiliod ; I think it is ipiito as logical, and yon 
aro bouml to got a botior Hi. 

Prol’ossor Karl Poai'son has givon a tost liy which you 
<■■1111 ooiisidor tho following ]iroblom ;“Sup])osing you had a 
])opulaiiou with oortain oliaraotoj'isiios, such as lioiglii, 
(iisLributod according to a oiiiwo with a partiouhu’ formula, 
roqiiirnd iho [irobability l.liat a.n assignod distribution would 
1)0 obtained from tlio siipposod distrilmtiou. Putting it into 
a. more ooiioroto way, suppose tho oipiation of tho lioiglit 
group foi* tlio whole ])opulntioii was this oijuation with 
n--=d’02!l inches, and j='(}(l: roquirod iho jirobability iluit IjOIlf) 
persons taken at random from tho population would liavo iho 
lioights actually ri'gistorod. Professor Karl Pearson lias 
given a tablet with tho nooossary liguros for dotormining that 
probability, {laloulation from liis table on this distribution 
shows that if wo take tho symmoirical curve tho probability 
of obtaining suuli a soluction is *4; thui is io say, tho chances 
aro two in live that tho 1,1)115 persons would not bo further 
from tho supposml distribution than they actually aro. If 
wo take the skew ourvo with j = *0(), tlio probability is ‘7; 
that is to say, tho odds aro sovoii to tliroo that wo should 

* Hi'o tJouvnul of Iho llojitd Sialistiodl l^oviohj, Jiiiio 1002, pp. 337*8. 

f Hue fiondon^ hUlin, and J)nhUn Phil, Mai),, July lOOO, p. 175. 



obtain 1^935 persons as nearly conforming to this group as 
we have found. It is very difficult to arguo back from tlio 
height of a person to the expression and T shall 

not at present attempt it. 1 have shown abovo that wo 
sliould obtain tliis formula of tlie curvo of error if wo wore 
dealing \vith chances j with events whoso occurrouco waSj 
by those terms, in the binomial theorem, Jlut tlio aamo 
equation will be obtained on very many other suppositions, 
and I have only taken the simplest. 13 of or o giving’ tlioso, 
however, it is necessary to defino a froquoncy curvo.’* 

If we are dealing with a group of moasuromonts which aro 
distributed about their average so that tho niiinhor of thorn 
which lie at any defined distance from their avorag’o, say 
between ic and (.r + d.r) in excess of it, can bo roprosontod by 
a definite function, say / (a;), of that distaiioo, thou tho curvo 
which represents this function, ^.e., 2/=/(flj), is tho froqxioncy 
curve of that group, If the unit of ordinato is so ohoaon that 
the whole area contained between the ourvo, tho ordinatoa aiul 


its extremities, and the axis of x, is unity, thon / yclr,~l if 

a and h are the limiting values of (g j in many cases a and h 
are + co . Then if the quantity is selootod at random fraiii 
the group, the probability that it will Ho botwoon and .ru 


y>dx; the probability that it will lio botwooii x aiul 


U} + d,e is ij.dx. 

If we take the experiment I instanced at tho bog’iniiing, 
the tossing of a coin, and make tho number of times tossod 
very great, the chance of obtaining given deviations would 
be given by the curve of error, as already shown. This is tho 
frequency curve for the group of experimonfs. Jilvoiifca aro 
ruled by very different laws of distribution. Wo may havo 
a very skew curve, as, for instance, in tho curves of agos of 
wives in Yorkshire whore the mode was a long way to tho loft 
of the average; the smooth curve which boat fits those 
observations would be the curve of froquoncy for tho agos 
0 such persons. That is to say, if wo draw this curvo, 
lepiesentmg as nearly as possible the observed facts, and wo 
make tlus area equal 1, the area standing on tho part of tho 
a^s between the 35 and 40.year marks would represent 
the cliance of a person taken at random being between 



35 and 40 years old. If wo were given fclio ago of a man 
who had a wife in Yorkshire and we did not know her 
agOj that area would represent the chance that her ago 
would he between 35 and 40. The life curvo, to tako 
another example, is a frequency-curve. To any froqnonoy- 
curve we can assign a modulus calculated from tho second 
moment. That tells one distinct fact as to the distribution 
about the average. The curve may have the greater part of 
its area to the loft or to the right of tho avorago, and 
it may have an asymptote as in the case of the curvo 
of error j hut there is in general only a small fraction 
of the area beyond two or three times the niodulns, which 
may therefore be taken as indicating the practical oxtoiit of 
the curve. It is often useful to speak of the precision (h)} 

instead of the modulus (c), where /i-= . The greater h is, tho 

c 

more ^ precise are the predictions that can bo made as to a 
magnitude taken at random. 

If we are dealing with frequency- curves whoso practical 
range is small and whose modulus is finite, and if wo tako a 
great ii umber of those frequency-curves, or rather if wo have 
to select from a great number of things whoso sizes aro ruled 
by different frequency-curves, for example, if we mako up a 
line of a great number of pieces of motal taken from difforont 
heaps with different frequency -curves for each heap, it is 
possible to find the frequonoy- curve for tho sum of fcliQ.so 
elomoiits, that is for the length of the line you have made, f 
will put that in different form with a different illustration. 
Suppose Avo are going’ to bake 100 books, and wo can select 
them from 100 different groups of books whose thioknossos 
are bounded within definite ranges and have a difforont 
modulus which can be assigned, required tho breadth of 
100 books put togothor. Tho most probable breadth will be 
that obtained by adding tlio averages of the 100 different 
gioups. J’roni the terms of the question it is obviously very 
improbable Ave shall g’eb all the 100 boloAv tho avorag’os of 
their respective groups or all above. The actual breadth will 
havo a frequenoy-ourve of its OAvn about an average Avhich is 
the sum of tho averages of the groups from Avhicli you soloct. 
Its modulus can be shown to bo tho square root of tho sum of 
the squares of the moduli of the original froqueiioy-ourvos. 
Thus, to take a special case, if Ave are going to soloct tAvo 



t]ungs only which obey normal curves witli Lho same mofliiliiSj 
tlie moduhis for the sum is a/ 2 times tho modulus of oitlior. 
The developments from this theory are of groat practical 
importance. 

If we take one sample at random from oacli of a iminbor 
of these frequency-curves whose moduli aro not very unequal, 
so that no one curve predominates, and add together the 
quantities so obtained, then the quantity obtained obeys the 
curve of error itself, whether tlio original rroquonoy-curvos 
were curves of error or not. I cannot give tho proof hero ; 
the theorem as I state it is partly duo to Laplace and partly 
due to Professor Edgeworth.* That is ono of tho most gonoral 
statements of tlie cases in wliich the curvo of orror will aviso ; 
and that conception may propei’ly be applied to tho concoption 
of height and tho causes which detormino tho persons^ hoighb, 
No single cause has very groat influenco compared Avitli oihors', 
so far as we know, and they all presumably have inoasniublo 
effects whose frequency-curves aro definite. Thus, wo might 
expect CL prion* the frequency- curve of heights to bo tho curvo 
of errm\ 

Another illustration is supplied by the grouping of .school 
children in a particular grado.t I took ono of ilio most 
populous grades in tho Report of tho St. Louis Public Schools, 
U.S. A., grouped tho children according to thoiragos, and fittod 
the curve of error by one of the methods i have do.scribod. 
IJie curve of error with j=:'073. Jits the obsorvatioiis 

closely. If we think of tho causes which dotormiiio the 
position of a child in a particular grade or class, L think wc 
shall find that they are akin to those I have supposed in 
my statement as to causes which load to tho asymmotriofil 
curve of error. But it would be ab.surd to go back and try 
to re-valiio p, q and n, tho quantities on which tho algobraic 
proof of the equation depended. We could find out, of courso 
what chances would produce this particular distribution j but 
they would have no necessary relation to the facts. The idea 
I wish to give is that wo can obtain tho equation of tho curvo 
of error m the form I am using it on a very simple supposition : 
and lb can be obtained from many other suppositions wliioh 
cannot be given in lecture work. 

* See Edge^sortli, in London, Sdin. mid JDuUm Phil Mag,, 1892, p. 4.29. 
t For thenumbers and diagram, ... Elements of SlatiBtica, 2ad Edition, Appendix. 
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Tiii'i Mi'Vi'iroi) dii' Ijma.st Squaukh. 

n 

fjlIIM’OSI'l ill III wo imiko n, ^Toiii mi my iiuiuHuvcinuiiLs oi' tho 
Hiuiio (|im.iitiiiy hy Hovi'ml tlin’oroui iiuiilHulHj luid Uiiiij as i« 
g’oimi'iilly ilm <’hho, Mu) iiimiHiirumoiiiH dilVov IVum oaoli otliov, 
owiitj’’ in imjiorIVM’.iitms of inHiriiimmiH, (jt liy ilio immorouK 
lumiilonial oiri'iiniHiiiiiooH iliii.i iiiiond luiy invol voiL oliHorviiiitmK. 
hoi iiK iiHSMiiKv ilmi l>lio tiioimiiri'iiioniM wliio.li (Mull'd i)o tikuIo by 
i]i(( lii’Hir ii)oilnKl lU'o ^‘I’oiijK'd ii(‘(!Oi‘(tiiij^' lo ilio ri' 0 (pioiioy-ciirvo 
I 

»/=:: (I "i“, ilioso by irlio Hooond iiioUhkI iiotsovclin^' La 

(•| v'lr 

I < 

fyr;: f! '1,,“, iumI HO Oil, (loluiiio iioruiiil oiirvo foi* ('imll 

(*;J / TT 

iiK'iliod, )duji|ioH(' WO imiko ii, iiummironiontH, oiui of oiudi kiiitl. 
li ‘iH i’0([uii’0il 1.0 liml wluii 1 h ilio iiiohIi prolmblo viiluo of ihci 
([iHiinioo io 1)0 inoiiHiiri'd. All ilio jii’h wo iiro doiiliiig with 
aro on'oi'H in onr iiK'iiHiivoiiioiii. Kmiii ilio hoWoh of parll}’’ 
on’omioiiH nioiiHiiromoniH ii in votjuiml io lliid ilio iiio.si pvolmblo 
v'liliio. Tliiii in ilio pi'obtoiH ii in aiiimipioil io solvti by ibo 
I noil 10(1 of luilKli Ht|iuu*oH, Ah ii koooikT ({noHiioii, ii ih roqnirod 
111) (loiovmino ilio [irooimoii of ilio romili, iliai in, io niato 
ilio ])rol)iil)iliiy iliai ii in ooiTooi wiiliin aHHig-iuid limiiH. 

llol'oro I’lii'ilioi' I 111 11 Hi oiill niton iion io oiio very 

iiiiporiaiii point in ilio roaHonhig'. In ilio iviiHoning on wliicli 
ibo iiioiliod of loiiHi NijuaroK Ih biiHod ii iw asHiimod ilmi ibo 
fi‘ 0 (piunoy«ourvoH aro nor mill ourvoH (A orror, jih writ ion above. 


If the frequency- curve is not a normal curve of error tlie 
method breaks down at the first step. 'J'hat I shall have to 
return to later. With regard to the moduli^ wo may either 
suppose that we know them by some a ’priori method^ as is 
sometimes the case ; or that we know tliom by having* mado 
similar experiments at some other time, if wo aro dealing 
witli a group of height measurements where the modulus is 
three inches generally; or we may find them fi*om the 
experiments themselves. A useful way is to repeat the moasure- 
ment by each method, say, 100 times, and from tho internal 
evidence find out what tho moduli are. Wo assurao that tho 
moduli aro fixed quantities, quantities which we cannot afioct, 
and that they are known or previously determined quantities. 
•'■•.Wliat is the probability that a certain series of errors should 
result ill n ohservation,s^? Let aJi, a’2, . . , . be the diiforoncos 
from the imkiioivn true value which arise from n diiferont 
methods taken in one series ; what is tho probability that 
those particular n deviations will occur at once ? ^fho proba- 
bility is obtained by multiplying* together tho probaliilitios of 
tlieir separate occurrences. The probability of tho error aii 
occurring, when the modulus is Ci, is from iis enrvo of 
1 

frequency e ciL The probability that tho n will all 
occur is obtained by multiplying n such quantities together, 
'll Hero the only variables ar(3 

CjCa . . . . c„ 

the X s. Now, that probability will be greatest when tlio 

index of e is greatest, that is when is least. Thus, from 

all the possible values of the unknown true inoasuroment, tho 
system of errors which wo have found would arise with the 

least improbability when 5 ) ^3 is made tho least possible. 

That is the statement which is at the basis of tho method of 
least squares. In the particular case, when wo tako all tho 
the observations by the same method with the same curve of 
frequency, so that c is the same for all the observations, tho 
minimal condition is satisfied when the sum of tho is a 
minimum; and wo have already seen that that sum is made 
least when the unknown value is taken to be the arithmetic 
average of the obtained values. Let me re-state this theorem 



in otluu’ \voi'( 1 m. vSu|)])«)H(> wo Hiiui io nuMisiivo a particular 

objoct by tlio HiuiHt moth 0(1 aj>‘aiii and again. 'L'hoiij tho 
nu^nHiinMiionis W() ol)tain would (joiiio with tho loast improba- 
bility wlnui tho HU 111 of tlio HijiiaroH of tho doviaiiouH is a 
iriiiiiiiHini ; and that condition ih Hatisliod if wo take the 
aritlniiotio avoraj^o of our iiioaKiironiontH io bo tho unknown 
trnu (jiuintity. 'I'hiH Htatoinont in a partioular cano of tho 
niothod of louHt H((uaroH. 

WIh'Ii w(i have g*riiH])od that initial ])rinoiploj tho roat of 
tho inv(’Hti{i;aiion is only a inattor of tho dilTorontial oalouluH; 
thoro in nothing’ Hpo(!ial about it. Wo have to writo down all 
tho ('(juationH that (ionnoot tho (]nantitioH wo aro inoaHuring’, 
and til 011 by 1 -ho onliiuiry prmuiHHOH of tho dilTorontial calculus 
ox])roHH tho (JonditioiiH tliat tho Huin of tho Hipiaros of tho 
orroi’ri h 1 ui ,11 bo iir niiniiminij and tlioHO will give enough 
o(|uationH to hoIvo for all onr uiikuowuH, 1 will illnstraio that 
algobraioally by a particular oaso. 'Pako tho case with which 
wo have alroiuly dealt, nanu'ly, that in whicli wo had tho agos 
of tho wivoH in VorlcHhiro. Tluu’o wo obl-ainod a soinowhat 
irrog'iilar oiirvo roproHonting tlio nunibors at di (Tor out ages, 
and wo HUioothod that ouvvo by putting parabolic oiirvos of 
tlio fourth (logi’oo through variouH pointHj and it will ho 
roinoinbtu'od tliat wo had to oliango tho oonstauts iu our 
01 Illation accord iiig to tho lairtioular group of fivo points 
Holoc.tiMl. Now lot iiH HHHuino tluit WO liavo a ]iaraholic 
oijinition of tho third dogroo in thin ruriti, 

ij ~ I'll, -b «|(»j -I- -b < 

'PliiH c(|iial,ion has four unknowtis; wo can thoroforo make it 
piiHH through any four iiHHignod points, but wo cannot make it 
]uiHH tlirougli live iiHHiginul poilil.H. Sii])])oso tliat wo wish to 
dotormliio an oipiation of tliotliird dogroo which will pass near 
tho live ]iointn, then wo will apply tho niothod of least squares to 
tiuit pro b loin, hot tho (so-ordinatoH of tho actual obsorvatioiis 
bo (iiiu Hiul so on. hot the oorrosponding points 

wliioli wo aro to find on tliis jiarticuhir curve bo (iTi, ^i), 
and HO on. 1 'ho ])oint (<r|i/i) will b (5 near, but probably not 
00 in Old 01 it with, tlio luhiit ((Hi?Hi). 'Pho dilloronco botwoon 
mi, tlio obsorvation, and i/i, which would bo givon by tho 
our VO wliloli wo liavo not yot dotoruiiuod, is tho orror of tho 
obnovvatiou. Wo iiro to doLoriniiio tho constants so that tho 
sum of tho sijiiaros of those errors shall Ixi least, Writing 



that a little more full^, and substituting for ij in tornis of aj, 
we liave that 

SJ ( »ii — rto — « “ ^ 24 — «3^?) ^ 

IS to be a minimum. 

In that expression the variables are the four a’s, which 
have to be determined so as to make the expression a 
niinimuiii. Therefore we must difPerentiate that expression^ 
wlien it IS written out, with respect to ^2) and 

equate these partial differential coefficients to fioro, obtaining 
as many equations as wo have unknowns. Then wo have to 
solve the equations so obtained. 

After a little siinphii cation the following equations arc 
obtained : 

5 . ff 0 + . da + Srcj'’ . Ua — = 0 

Sa5i . «o -f . Ui + . Oa + 2er P . da — = 0 

Sa’ih c(o -h 2a;d • d- S.r / . «2 + . as — = 0 

. do -f SiTi'* . di + S.ri® . 02 + . «a “ —■ 0. 

The chief thing I want to saj'’ abut theso aquations is, 
that they are so complicated, and a solution is so lahoi'ious, 
til at they must he put out of court for all ordinary 
calculations. If you wsh to construct a now table which will 
be of some genei’al use, it may bo worth while to go through 
the solution, but not for any single jiraotical piece of worlc. 
Every one of those .separate terms, Sajj, &c., have to bo 
calculated arithmetically, and the equations havo to bo solved. 
Even in this simple case we have four equations each containing 
four functions. In Merrimaids Method of Least Squares,” 
the simplGiSt methods for that evaluation are given. Many 
terms di’op out, and the evaluation is possible ; and in some 
cases we can so choose our origin and take advantage of 
certain points of symmetry in the equations, that the work 
can be simplified. In this particular case a simple solution 
has been given by Professor Damvin,* 

Fitting Forjiul/e to Observations. 

Before we look for another way, let us consider again 
whether the assumptions on which the above method dopond 
are justifiable, or will justify tho great effort wliioli would bo 

* See Dflvwiii, “On PallMc Measures/* London, ISdin. and DuUin 
Mag., July 1877 j used in Lllemenis of Staiistm, pp. 25C. 257. 



lUH’OHHiivy U) Holvo Ulo oquiliions. f iluuk iL will bu foiiucl 
Hull ill goiiuriul ilioy do not. Tf wo look bade ilirougli tlio 
ar^miioiit, it will bo .soon Unit tlio origin al assumption is tluit 
tlio (lill'oroiico bolwooii 7?i, Uio actual nuinboi' of povsoua 
ob.soi’votij and tlio iiuinbor obtained IVoiu tho oquatioUj 
bolonf^'s to tho iioTinal enrvo of fi’ 0 ([iionoy j and so in ovory 
cuHO wliovo tlio motliod oT least squares apjilios wo liavo an 
obsorvod inoa.siii'oiiiont^ and wo obtain a Uioorotioal inoasnvo- 
luont, and ivo assiiniu Unit tho dilTorunoo botwocsn tlio two 
bolong.s to a normal oiirvo of frisquonoy. lie Coro we oan 
iiiako that assninptiou wo iinisi voril'y that the condiiionSj iindor 
wbioh tho iiorinal oiirvo of frotpienoy i.s obtained, arc satisliod. 
Wo tiro not in a position to do tluiU it' wo dopoud only on tho 
al^’obrtiio proof givoii aboro, without invostigafciiig’ tho 
dod notions of tho o([uation of tho curve of error roaiing* on 
othor liypothosos, I hit to luy mind thoru i,s no proof yot 
given which duos show that tlio normal unrvo of error will 1)0 
obeyed in the civciiinstaiico.s J have just inontioiiod ; and 
Professor Ktirl Pearson luis sliown tliai in very many instances 
tho uorinal enrvo is not obeyed. iSo the theory is at any rato 
dlllicult to ostablisli and is not supported by miivorsal 

(nxporionco. I think, with all tlie doferoiico that is duo to 
Ih’ol'ossor ICarl Pearson, that tho mattor jmt wants more 
praotioal oxporiouco liofovo it can bo fully docidod, It would 
1)0 nil safe in Uio presont state of the avgmnont on tho one 
hand to say that the normal curve of froqiioncy may bo 
expected j or on tho othor hand to .say doliuitoly that it is 
not to bo oxpootod, because it lias not boon iinivorsally 
foil ml. '^riiat is too diflieulfc to deal with at all thoroughly 
boro. 'Plio reason J have gone so far into it is this: if tho 
method of least st|naros is very difficult to apply, and if it is 
noitlior supported suffieiontly liy thooiy nor by oxporimeiit, 
then it sooms expedient to try some othor inotliod. A purely 
einpiricul niethod would bo this: Insload of making- the sum 
of tho Htj^uaros of tlio deviations a ininiinuni, iiiako tho sum of 
tho (irsl powor.s of tho cloviatioiis, all roekonod as positive, a 
mini mil in, that is to say, roinovo tho square outside the 
bracket in tho expression oii p. I<8. J3ut it is nob at all oasy 
t. ) make that sum a minimum, because all tlio terms have to 
bu taken as positivo, and wo do not know until wo havo finished 
our work which tonus are naturally positivo or which terms 
are nog’nlive. Professor iUdgoworth has given a motliod of 



getting the solution when there are only two unknowns.* 
Wlien there are three unknowns T beliovo thoro is as yoi no 
practical solution. 

Another method, .still taking tlie method o£ least squares 
as the basis, but ayoiding the very complex solution, is to 
choose the coelRcionts, .so that the curve will jms.s through 
exactly the four points assigned ; and thou ro -calculate tlioin, 
so that the curve shall exactly pass through four oilier 
assig-necl points ; and so continually calculato again and again 
the coefficients, getting a series of curves. Tlion froin the 
various values of the coefficienis so found, choose thoso 
coefficients which appear to give the best results. It is really 
a, makeshift method. I think it has been often employed, 
and the results have been very satisfactory. If, by one method 
or another, you get coefficients which make the tliourotical 
curve pass near the original curve, it docs nob maitor by 
wliat process you have got them. Such a method as that, L 
tliiiik, is in general use for approximating' to the poinilaiion in 
iiiter-ceiisal years. I think the Census Office has never 
published this method ; but as far as I can find out, Llio 
method employed is as follows : Supi^osing coriain points 
represent the population at the various dales at whioli it is 
exactly enumerated, then if, as a first hypothesis, we assume that 
the population increases in geometric progrc.ssion between two 
enumerations, wo obtain a simple curve passing from one point 
to the next. Then assume again that from this Cons us to tlio 
next there is another increase in geometric progression, and avo 
find that the tAvo curves never have exactly the same constants, 
ilien obtain some method for passing from ono curve to the 
other Avithout a sudden break of curvature, reject the parts of 
the curves near the Census years, and roplaco them by a curve 
winch gradually passes from one to the other, ^Pliat is a jjurely 
empirical method, and I think it is the ono adopted, It is in 
some such A\^ay as this that Ave can go to Avork if the method of 
least squares is too complicated. 

The third method, to which I Avish to call aitontioii vory 
particularly, proceeds in quite a different Avay. We tabulate 
oui observations as before, and Avi'ite doAm the equation of a 
curve Avliicli is assumed to lit them, Avith imkiiOAvn consta.utH j 
calculate from the observations the moments— first, second, 


if.r “ 0“/ Now Method of deducing Obsorvatious/’ rhil. 

Mugt 1888 j used m Journal of Ho^al Stalislical Soclelti, Juno 1002, p, 3-11. 



(Jiinl, fniii’i'li (itH iiiiiiiy iiM iiro unknowiiH) — iikouL llio 

(U'liii'o (>!' by Uu' iii((I. 1 i(hI. usod n])ov(', luul cakiilaio 

(,|i(> iiioiiioniM I'roin l-lii' iisHiiUK'd in b'vtiiH oC Llio 

inikinnviiH. I.Ik' nioimnil-H I'oinnl from (ko olisorvaiioiiR 

wil.li fill' 1)10111 on I'M round foi' llio iissiinmd diirvc, wo Inivo (Iujho 
iM()nil-ionM d('l('riiiiniii<'' flio ooiiHfiiniM. l'’oi' oxumplo wo may 
(iiko Mio iiiHl'iinot' iiJi'oiidy diKOiMHodj wlioii wo found a skow 
(•iir\'o of (‘I'l'oi’ lo lil. ri'i'iniii oliMorviifiouH. Tlio ^'otioml 
(‘<|iinlion 1.0 l.lit' wkow oiii’vo of on'oi- hoin^ ^’ivoii, l)y (dio liol)) 
of llio inb'f^Tii.l oiiloiiliiM wo Hlalod Mio valtio.s of ilio Ih'Hi, 
Hooond, iMid l-liii'd inoinmifM in ioi'iiiH of a mid j j wo oquaiod 
l.lioMO io iiio tnoinoiil.M onloiiliih'd (Void (Jio oliHOrvutioiiMj and 
UniM foiiiid <• and j. VV(' ti(>('d |,o oaloiiliiU' a,s many inoiiKmLs 
UH l.liot'o iiro iinknowiiM in Mio paHioidar ('<[nal/ion KolooLed. 
|<’oi' iiiMliinoo, in Mnkoliiiin’M fonnulii, Uioro uro fom* imknowiiHj 
mul wo havo l.o liiko four inommilM. In flio normal oumi of 
oi’i’oi* I. lion' iiro lovo iinknownM, ifs oonfro and ilio modnluMj 
I, wo nioim'iiliH ni'o l.ln'i’t'foro Mnllicii'iii l.o lind (ko normal ouvvo 
id' orror liy I'liin (om|.. In Mu' nki'W I'lirvo' of orror, (ko ((iiaiiLity 
j IniH lo 1)0 (loliorininod in addifion. In Idio om|)irioal 0({iial.ionH 
{'■ivt'ii by (’rof('HMi)i’ Karl IN'iir.snii in IiIh woll-kiiowii papor on 
(In' inoimnromoiil. of akow p’i'ou|)M, wliii'li wjim jiiibliHliod in tSyf) 
ill Mio I Voooodin^;'M of Uio ll.oyal Sooiol.y, ilioro ai'o four 
itiiknowiiM, and Mioroforo in g’oiioral lio noodod fonr moments. 
Ill tlio paralmlio iiil.ori)olii.l.ionH, miioIi as f havo usod in tlioso 
lootiiros, l.li('i'(i iii'o as many unkiiowns as wo like) to tako. If 
wo stop at S'", wo nood four nionionts. In iVofossor Parotok 
ompirioiil o((miliioii for tlio p'i'onpin^>' of tho incomes of tlio 
[Mioplo of a oomiti'y tlioro aro two unknowns. 'Plio 

oqiiatioii is IIS follows: if ^ , wiioro // is tlio numbur 

of porsmiH In roooipt of iiicoiiio .r^ and j\, a iiru constant. 
It is nisn p'ivoi) in a dovc'lopisl form ivitli one more 
oonstiiiit. It is Hiippnsod tinit tlio iiidox a is nearly tho same 
fni' n,ll ooiintrii'Hj wliih* A viii'io.s from (amutry to country. 
Von could obl,uiii thnso viiliu's hy the principlo of least 
HqiiaroSj or by oiinal.inp’ moniunts, This is not tho ])laco l(' 
cril.ioiao tho oijiiiition : I only fi^tvo it as an o.vaniplo of 
algobniic ('qiiatioii for statistioal |>’roii|)Iiig'. Wo soo thon how 
to obtain snlllciont oqiiiitions for tho unknown constants, 
and so wo ('oiiu) iiiiitui'iilly l.o Mu) (jiu'stion ol what is tho 
jiistilication for this motliod. I tliink I must i-ofor vm. Jn 


general, to Prof essor Karl Pearson-’s papor for tlic justifications, 
because it is his method, and in particular he has quite recently 
publiBhod a paper in. the journal Biovietrikay^ going very 
carefully into tliis ivhole method j and all I can do is to simply 
foHmv in his steps. The method deponds on a purely empirical 
basis, not on any a priori theory. By its inoan.s we do, as a 
matter of fact, obtain an equation which fits the ob.servations. 
But, incidentally, Professor Karl Pearson shows that the I'csults 
obtained are, in general, the same as those obtained b}'- tbo 
method of least squares. Without basing bis system ujion 
the coincidence at all, ho docs obtain tlio same results. The 
advantage of the method i.s, as he has also shown in tho 
same paper, that tho solution of tho equations obiainod is 
very much easier than the solution of equations obtained by 
tho ordinary method of least squares, I hesitate to go further 
into this subject because it is Professor Karl Poavsoids subject, 
and all his papers are very easily accessible. Tie bus shown 
that empirical algebraic formuhe can bo found Cor a very 
wide range of groups, and in ovoiy case ho has fitted equations 
to the groups by tho help of this number of moments. Ho has 
then found that tho equations so obtained do fit tbo groups 
exceedingly well. Groups may, perhaps, contain 30, or 40, 
or 100 measurements, but tho constants at disposal arc only 
4. If you calculato these 4 constants by any niothod and 
obtain, as a result, the equations which fit a wido range of 
observations, yon have a strong empirical justificalion for the 
method. I believe that is tlio justification which Pi'ofessoi' 
Karl Pearson gives for the method. But wo are met face to 
face with this difficult question, which it is impos.siblu to deal 
with here and now : How far ought we in such invesi-igations 
to take empirical formulte which are only justified by their 
results, and how far should wo base our reasoning on a priori 
assumptions as to the nature of error, and as to its occurronce, 
assumptions which imdorlie the theory of probability, and 
from such assumptions obtain our equations ? Should wo 
obtain oi^r equations with the view to fitting tho result, or 
should we obtain our oqiiatioiis from a priori reasoning and 
see how far they fit the results f To my mind wo have not 
nearly enough experience in the matior at present. We have 
not sufficiently tested the fitting of groups to tho cu priori 


* Siometyikay April find Deconibor 1902. 



equations, nor liave wo yet sufficient experience to say that 
tho empirical motliod is universal ]y satisfactory because it has 
lieen found to fit wide ranges of groups. At that point I 
must leave the discussion. 

Uses oe the Curve or linROR. 

Wliatever may bo tho ultimate decision in the questions 
which I have thus stated, there are certainly many usos for 
tho cur VO of error in tho form in which I gave it in the last 
leoturo, quito indepond eutly of tho discussion wo have just 
1)0011 engaged in. Tn what I have been recently saying I havo 
boon following, as far as possible, Professor Karl Peanson^s 
motliod. In what I shall say now I am following Professor 
hjdge worth’s work. J do not mean that tho two are 
contradictory in any way ; I wish to indicate that I am 
trying to summariKo the present position of this question on 
tho lines of the two most eminent authorities in this particular 
work. Por clearness, I repeat the method of generating tho 
curve of error given on p. 43. Suppose we have a number of 
frcquoncy-curvos, oach of small and limited range, that is to 
say, of great precision, its modulus being .small j let the 
moduli of n such curves calculated from the squares of the 
deviations bo Ci, Cg, . , . c„. Tho curves may bo of any shape, 
except that no lliiiio part of tlioir areas may bo at a g'reat 
distance from thoir coiitros of gravity. Suppose we tako 
observations belonging to tlio hast cuvvo, Og out of tho 
second, and so on, and add them iogotliorj tho curve of 
froquoucy for tho resulting sum is the normal curve of error 
with modulus If instead of ialdiig the sum, we 

tako any other function to wliich tho sum is tho hrst 
ap])roximation, tho curve of frequency for tho values of 
this function is likclj'’ to approximate to a normal curvo of 
error ; but wo will hove limit ourselves to tho sum. Tho 
following diag-ram and tho oxporimeufc on which it depends 
illustrate this theory. I took Chambors’ mathematical tables, 
and ohoso ihroo digits at raudoiu and took their average, and 
ropoatod this a thousand times. The curve of froquonoy of 
the 10 natural digits is a straight lino; you aro as likely to 
got any ono of thorn as any other, if you select a suitablo part 
of tho tables. T liavo r op resented that curvo of frequonoy 
by ton dots. It is limited at both ejids, its modulus is fairly 



Diageam IX. 

Frequency line for the 




I Vis',, : 'I-' (HI, itiid i(, Hiipplic's a vovy scrntro tonj. of tlio 
priiH'ipli' I Intvi' (aiinnanl.CHl, Ixhuui.so wu liavo a onrvo of 
i')’(M(uiui(\y wliicli is a,l>Holii[,(*]y diJfoi'CMii, I'rom Mui normal 
<'itrv4' c>r orror; il> jIuc'm not, apprcixiiiiaio t,o li in miy 
wlin.l'itvor. 'I'ho atilriiiil of tlio (KniiToiicc of 

VMIMOUM lllinil)(SI‘M Iir41 (,ln) ooiillhaoiitN in tlio 

oxpiniainii oC (I I i»i-| ,iV‘ I . . , H' 1 ,000. Ooiuptirm^ fhoao 
wi(,]i jv'.Hull, ol' (.lu' oxj)oi'iiiuail, wt) liiivo fJio following 
iiiblo ; 



III iH iHil toy jioiiili lioi'o lo aliitw Lliai IJioho ligiiroa iiTOAvluxi 
yo»\ w'uuld osspcnH, (u ; wlinl, I winli lo hIiow Ih, fii'Ht, Uuif 
l,|io aiit'(!('Hai VO ])i’ol)itl)jii(l(iH, wlu'ii t,lio,v avD ]>UsUi>d out>j 
l,|io oiii’Vo td' ('j'l’or; aiid, Htutoiidly, l,liul, Ibo oxporiniont 
l<nidN In (it, a imi'itiul ol lU'i'or. In Diagrinn IX Uio 

(‘oiil.iniioiiH lino wil,li <l<tla on i(- ibo l'i'(!(pu'noy whiult you 
would oxpoc’l,, 'I'lio lM'olc(n6 lini' in f.lio <5iii’\'ool orror.wiUi lluv 



saino area and modulus^ and tlic crosses aro tlio positions 
olitaiiied from the actual experiment. It is scon thali 
tliongh -we started witli a fi'cquency’cnrvo wliicli was a .straiglit 
line, that the tlieorotical curve wliicli wo ubtainod for tbo 
average of only tlireo terms selected from it is already so innch 
like a curve of error that yon would mistake it tor one, if a 
model was not traced on the paper ; and that the actual 
experiment supports the same view, 

Wo note that the modulus calculated from the squared 
deviations for tlie natural digit. s is 4’06, and that from the 
formula ^(SaV) given above the modulus for the sum of 
three dig-its should be yS x (4‘06}''^— 7'032, and for the average 
of three digits should therefore be 2 ’344. The modulus of 
the curve given by the calculated probabilities of the various 
numbers is 2‘345j while that calculated from the results of ilio 
experiment is 2'358. The averages are 4*5 (theoretical) and 
4'494 (experimental) 

CoXSTRUOTION OP A GliOUP PROM SAMPLES. 

The tliGory which I have just cnunciatedj for the proof of 
which .see the reference given on page 44^ is^ that if we start 
with any frequency-curves, and take our examples from them, 
one from each or many from one, and take the average, wo 
shall obtain a curve which becomes more and more liko the 
curve of error as we extend the number of our oxainiilos, 
and ns the frequoncy-curvos satisfy more and more nearly the 
limited conditions which are laid down for tliom. Now, that 
is not only a matliematical theory : it has very great practical 
importance. Supposing that wo take a nmnhor of samplo.s 
out of a large group, how near the truo average may ^vo 
expect to get? If the curve of frequency of the g-roiip was 
a curve of error, we can at once write down the prohabiliiy of 
different divergencies. If wo have a curve of error with 
modulus c, and we select n samples at random from it, and 
then take their average, the modulus for Lhoir sum is from the 

formula already given, ynG% and hence that for their avorago 

• ® 

The precision of the arithmetic average varies inversely 

as the sqnai’e root of the number of items, a very well-known 
principle. I vdsh to show how this theory can be adapted to 

o. ,"1®” ^^^3emrth, in Jubilee Volume of the Jounml of tlio HouaX 

Statistxcd Hocietif^ p. 186. 


ciifVC'M <.r hv(|tuMirv l.hini llin non, mi cum* of on-or 

Sn|»|><is(' l-fio orin’ini.l ciirvo of frixiiictK'y |<, Im uuy cum' 

wlu.I.'v.'r, n snrvfvors Ini' oxuinplo, I do .tol’ ussimio 

any |uirl.i'ciilitr slnipo l.n i|.. , Suppose wo f-'o (, lining], an 
<^N|K*nnK'iil., Inking, wo will s„y, -m cuunph-s al. mmlom IVoni 
il', "lid rojM'nf Mm pmri'ss linii'S. | h, 1,3,0 cxporimoul, 

jiisf diNcnsstMl m was only Ihroo, mid A* was 1000 .] Though 
Mm ni'igimil niinili(u-s (l<i nol, (,|ioy |.|,o uornuil cumi of cmtoi-, 
yol. Mm uvonigv of m of Mmin may ho oxpocl.ml l,o, when m is- 
sullii'undl.v jvn.jil.. ,• b,, Mio inoclulus for tho groii)) of 

avonigosid’ m Hnini,l<*H; Mnui may bo cxpookid to bo M,u 

mod II I IIS for Mm n.vonigi' <d‘ Mm whole itiuHS of fun Hfimploa. 
ThuM, ill Mm iibovo cxporiini'iilr, c wns k 1000, mid 

- .-(MM. ; Mm known iivorngo for all digil.a, wliioh formod 

Mm orig’iiial onrvo of fr('i(ii('imy is 'l.'r,, Mm uvorago fo,- Mm 
d.OOO Holoci.od, in 1,000 grou)i.s of t.iiroo, was d-'d-Gd; Uio 
dilToroimo is oim-foiiMj of Mm modulus jii.sL cilcnliitod j sosimdl 
a iliH’oroiioo iniglil, bo oximoiod oimo in nine Irials. 

Thus, w'lmMmr (,lii> <‘i,rvi> ol IVoipii'iioy of Mio original grou]i 
is Mm iiornml oiirvi' of t'rnir nr not, Mm ju-ooision of Mu' 
a von, go of 11, ^vroid- miiubor of Hainplos is proporl-ioiml lo Mm 
sipmro root of Mint nuinbor. 

Now lot im HIM! how to oonstruct mit nmroly aii iivorngc', 
bill, u wliolo jJiroiip, liy Mm nmtliod of suni|,los. 
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Tlin liiJiln iiMil n'ivp ilin itsuIL of iiii oxporiinonL in 

jOK'li fiHiHtriK'lioii, Tlin iiiiiiiii'iiil nf LIk' oxpcM'muMil, is of no 
iiopo]'lii)i('o Ihm'o ; I nu'iviy look l!i(' inosl. lU'ct'Hsiblo (ij^ms's io 
coiuliicli Mio ('xpi'i’iiiK'iil, iiiniioly, flio oOiciitJ {liizoifo pvicn's of 
wIk'iH. for I'lio (5-1(5 mioiiMim for wliioli (in-y ms* rooordod in Uio 
Kfiil.iwlii'iil lib'll. rmds, iiiid is>p'tird(’d Mini ns n p'l'oup of I, bine’s 
wliiidi I wiis p’oitip' io build up by Hiiitipbi. I‘\u' roinploio 
illiisiriil.ioii 1 bad lo hikoap’nmp 1 know, and Mion io iako 
HuniploM id' il, 111 ‘‘'oiioral, id' ooni'so, Mio p'riaip is noi known, 
biii ban lo bo ooiisinioiod from Mut sani])los. Tlio aoinul 
p-roiip is I liai p'ivoii in Diiip'i'am N iti Mio ooidJiiuoiis linos. 'I’o 
nblaiti ilio satiiplos, I look (llianilKU’s’ ina,Mi('iaaiioal ialilos, 
)Mid assi/i'iiod (o pindionlar mindim's, from 001 lo (I, ‘50, 
001‘iain iiiomMim, and ioidv 1 00 niimliors of Miroii dipdis ai 
riimlom, No\i, I wroio down Mio jirioos in Mio 100 inuntlis 
oorrospiuidiiip' [,o Mmso 1 00 niinibiM's, and ^roujiin^’ ilioni in 
IOn. ^ji'uiipu, idiininod I-Ik' iiiiniliors f*'iviui in Mio Uiir<l oolunin 
iiiliovi', amt also p’ivon by ibo orossi's in Mio Diap’ram X (a). 
I iioxlf Nidooiod 155 siimplos by lakiiip’ ilui lirsi 25 of Mio 100, 
and I p'roiipod ibo lip'iiros in 20iV. p'roiips, and obiiiinod Mio 
imnibors p'lvoii in Mio lil'Mi oolunin and by Mui ovossos in 
DiiipM’i'in X (n), VVliai I'lilo liavo wo for dooiilinj^- Imw noar 
ib(i iriio p;roiip ibo Maiuplo is’f In Mio iliird division, for 
iimiimoo, Ih'IpWooii iiml dO.v. in ibo wliolo p’ron]), Mu'ro iiro 
|.‘l-i iiiiiiii, 1100(1, [i.ml 21 por ooni. of Mm ari'U is boiwomi OO.s-. 
and dOw. II wo iako 1 00 I liinps ai random oni of ibo wbob* 
{••1‘oiip, bow many id’ ilmi 21 por omii. aro wo likoly lo poi V 
'I’liia ill a siinplo proldom in probaJiiliiy : if n .sainplos aro 
Inkoii, ibo olianoos Mini 0, 1, 2 ... a will oonio from a pivon 
pari, wbiob is io ibo wliolo as is yi io 1, aro ibo siii'oossivo 
oiM'llioimitu of Ibo oxpansion of (7 | wbi'ro 7 =l—y); as 
a inormisi's wo approx iniaio lo a oiirvo ol Iroipionoy wiib 
niodnlim s/liy»/a (soo p. O-i). I n ibo Miird division p = *2I , wliilo 
a, ilio wliolo iinnda'i' of siimplos in Ibo lirsi oxporinioni, is (00. 
iioro y:i/a/a. ■ ^(2 x -21 x '70 X 100) "5'H. 'I'lio dilToronoo 
boiwi'on ibo iKdiiial niinibor por lOl) in ilio proup, nanioly, 2(, 
and Ibo nuntlau' found in ibo saniplo, nanudy, 1(5, is loss Mian 
ibo imaliilus. In aJI ibo oibor oasiai in boMi oxporinionis Mio 
dilVm’inioOM aro wiiliin Mio “ jirobablo orror (wbiob is '47 of 
Mni niodubm, mu' p. Hll).' Wo bavo Mins found a oriiorion of 
ibo divm'ponoios lo bo oxpm-iod bid-woon ilm di.siriluil ion of 



magiiitiides in a group of samples and the distribution in tho 
unknown group from which they arise. 

As regards the precision of the averages of fclio samples, tho 
modulus for tho original group is about IQs., niidj therefore, tho 
moduli for tlie Eivoragos of 100 and of 25 samples, respoctivoly, 
are 19s. ~ \/l00 — Is. lid., and 19s.h- \^25 = 3s. lOd. Tho 
averages found from tho samples are actually 45s. Ad. and 
40s. 6d. which are, respectively, Is. 7c?. and 2s. 9c?. in excess 
of the average of tlie whole group. 

The experiment, therefore, forms a good illustration of tho 
theory, and on consideration it will, I think, bo found tliat 
the tlicory is in strict accordance with comm on-, sense and 
common experience. 



MIUSUIIKMI'INT OF GROUPS. 


l'’ll'"l'll I.KOTUlUi. 


(Jni{|{t'll,Ari()N Ill-ITWIIMN Two (iROUL'.S. 

IjI'jT l-lioro 1)0 i)iiirH ol’ moHsuvoinuiiliH mul ko 

oil lip To l-lio moiuhorH of 0110)1 jiiiiv liiiviii^’ soino 

ilotoi’inliiafo i‘omiootiion \vil-!i oaoli olJioi’j for oxaniplo^ suppo.so 
Mini) ilio ill’s aro tlio iif^’os of Uio wivos in 1 -hu group takou 
al)ov(i, and tlio i/’u (Jio ag’os of Uioii' liiiHliaiulSj ,i',. and y,. lioing 
l.lio ag'oH of a iiiai'riod 0011}) lo. Tliis is Uio oxainplo diHOUHsod 
bolow. Or mippoHO f)ia(> nv is Uio iig’O af wliioli a mail dios^ 
inul 7 /,. Uio ago iil; wliioJi !iih faUior diod; or wuppoHo Uiai 
ai'o na'aHiM’oinoiilrH of pliysioal oiiariudoi'isUcH of Uio aaino man. 
Or n.gain, dv niig'lil) lio a diiaUi rain, in a year in wliioli 1/,. was 
ilio ii-vorag’o ioniporaiuro, 111 is rnijuirod to niuasuro Uio 
rolaiioiisliip boiwooii fu’s and i/h so as in an s war Uiis qaosiion : 
Oivoii 01)0 of Uio Ill's, assign Uio ]irobab)o valuo of Uiu 
norms] londing' //. I^'or oxainiili', g’ivon IpIio ago ai wliioli a man 

diod, assign (»bo niosi jirobablo ago io wliic'h In’s son will Hvo. 
Or, iatcing' 0110 iiionibor of Uio grou]i of wivos ai random, sLaio 
Uio ])robabiliiioH of ilio ago of lior linsband. Wo liavo in fad 
1,0 givo imniorinal oxjirossion io snob siaiomunis as ilicso : 
A liigli doaili raio goos wiUi a low iomporaiiiro j a long’-livod 
faUior has loiigUivod sons; for two sialomonis wlioro 
(iWO nioasiiroablo ([naniiiios aro oonnodod in Unit way, wlioro 
in common ])arlanco wo ooiinoci tlioiii with simplo adjeoUvos, 
wo bavo io find a miiiioi’ioal or maUioimitioal expression for 
Uio robiiioiiHlii]). lUrsi suii])Oso Uiai ilioro is no oausal 



connection between two groups. Then if wo soloot any 
particular place on tlio axis on wliich tlio fc’s aro nioasurod, 
and mark in the corresponding' ?/^s, wo shall g'ot a group of i/h 
wliosB average is equally likely to bo abovo or below tlio 
average of all the y’s. Suppose we choose a group of wivosj 
between the ages 23 and 30^ mark in iho ages of tboir 
luisbandsj and mark the average of such agos^ if thoro is no 
connection between the agos of the one gron}) and iho ages of 
the other, the average of the groiq) so iakou will bo near or 
ecptal to the average of tlio whole group of liusbaiids, namoly, 
42 years. And so^ if wo take another jioriod and mark in iho 
various ages of the husbands wo should again find iho average 
near the average of the whole group. If tho le's are 
represented on a horizontal axiSj and tlio aro moasurod 
vertically by points placed abovo tho values of x whicli aro 
their pairs, ilion if there is no causal connection lioLweoii tho 
magnitude of the .ris and of tho y’s, tho av'crag'os of groiqis of 
the yis oorresj^oiiding to assigned iutorvals on iho axis of x, 
will all lie near the horizontal lino through tho averages oJ' 
tho ifs. filiey will not lie on it, but tho best straight lino wo 
can draw near these points will bo a liorizontal lino throiigli 
the average; that is obvious as soon as tho statomont is 
understood. 

But now suppo.se there is a causal connection botwooii the 
tivo sets of nioasurements ; suppose, for examplo, that a high 
value of X goes with a high valuo of y. d'hon if wo start from 
tho average value of <r, which wo may assume for tho moment 
corresponds to the average value of y, and pass to tho right 
and choose a group at a jilaco abovo tho avorago for tho fcls, 
the y's wliioli are obtained for that group will bo distributed 
about an average abovo the lino. And as wo coiitinually 
mark off the averages for group after group by points, they 
will lie on some curve which tends'upward to tho right from 
tho origin and downwards to tho loft. (Seo for oxainplo 
Diagram XI.) If, on tho other hand, a high valuo of x wont 
with a low valuo of y, thero is a chang'o of sig'iij tho series of 
averages would go down to the right and up to tho loft. The 
exact method of drawing a lino blirough those points I do not 
propose to discuss very miimtoly. We could draw a smooth 
line by tho methods discussed in tho first locturo, or a 
freehand curve. We can either draw a straight lino as near 
as possible to iho dots, or wo can draw a curve. X shall 



ii()li Mil' f>niini'iil nimiKi nl' llml, nivvo ; I slm,!! imu'oly 

iisKUiiK' llinij IVdiii Mi(M)l)Hni’viU ioti nr wo ctiii draw 

oiirro. AihI ninco in iniy mirins of oliHorvaiiniis Mm 
parMc.iiliir n,vt'i'H‘pin aim liiildo in Hlip’lii d iH|iIiio(mu'iil., in a 
liiiiio niiiiilmr (d' idmorviiiiiniH wo do iioi Mk'. hiokI, prolmldo 
jioitii with oiu’li iivoni^p', iiml iimsi sninoMi Mio liiann Mm way 
wo liavo (liHOinmoil. VVi' niiiy ti^siinm an nipiiilion, 
wliiidi |*-ivoM ilio avonif';o of Mn^ i/’n fni- iiin pariioiiliir valimn of 
.r; Mini in only f-nviiip' ii, |•’Oll(>nll form l,o Min Hiaioimmi, iliui a 
valuo of ji in ooiuioriiMl wiMi a viiliHi of ,t' liy a iloiorminaio 
o<{iniiinn. 

'riiin ntpiaiinii, nf coiti'm', only o’ivi'H Mio jmsiiion of (,]m 
avonppiH of Mio solooinil pmoiipn of i/’h. I'jvoryono of ilioHo 
p;roiipM Imo iin own fnM|iioiioy-oiirv('. M' wo wolooi ap'uiii Mio 
iip’os of ilm liindnimlH <d’ ilman wi voh wlio.so tip’i's nni Imiwoc’ii 
‘J!5 imhI wo ciiii (li*a\v a rroipioiK^y-c'iii'vo foi* Mini f^'imiip of 
ImHliandH, Inii Mio rmiln’o of Mini froipioimy-oiiiwo will no 
lonp'or 1)0 ni Mio avoriijip' ii.p'o of all Mio liiiKliaiids, if Mioro ia 
oaiiHii.l ooiiiioid.iMii lujiwoon ilio |L;*roiipMj hni as Mio ^'I'oiip ialcoii 
in Imiow Mio ayMini^'o of l4io wiro.s, ilio ootiiro of iliia tmrvo 
will 1)0 holnw ilio avonip’o for ilm IiusIumkIh. li is noi 
imoo.HHaryj in f^ononil, lo imiK’o niiy aiiompi io draw Miis 
froipiunoy-oiirvo poiiii by pnini, hni only io iaico iia ooniro 
and in huhio oiisom its modnliiH. lusii^ad of doaling' witli 
ariilinmiio avoruji^i'Hj wo may mpially woll iiho ilio nmdiaiiH of 
ilio fyi’OllpH. 

Wo mi/,;'lii ('iilvO, Tor oMi.nijilo, siioh a (jiiOHinoii as iliis, a very 
old ipmaiioii; Man ilio ])rioo of wimai luiyiliiii/:^’ io do wiili Uio 
initrrinp’o rai(« f In huoIi ii onso an iliai wn ploi oiii ilm prinos 
of wlioai in ililVoroiii itionihs or yoars iiionp’ ilio axis ol’ tr, 
iind pni ill ordiiiaioH Hhowiii}.}' ilm avorup'o niai'i'iiigo rain whun 
ilm wlioiii was iluii pariioular prion, mid ilio dirooiioii of ilii.s lino 
or ilm form of iliin ourvo would f^'ivo, wiililii ooriiiiii limii.s 
iloali wiili linlow, ilm aiiswor l.o iliin (piosiion, wlmiimr iliuro 
wuH a oonminiion hoiwoon ilm iwo or noi. If wo do ohiaiii 
from our olisorvaiionn iliai iimro is th lioiidoiioy upwards Lo 
ilm rip'll i anil liowiiwai'iln in Mm lol'i, or luc.n rnmi, wo liavo 
found iluii ilniro is sonmiliinp' (loinmoii iii Mio sysiuin of 
oauHiiiion wliioli pvudiioos ilm iwo sois of plinnoinoiia. Wo 
oammi say iliai Mm m’s am Min naiiso ol Mm //’s, nor viva 
lUimi, ))iii only Lhai Mm iwo jilimioinuiui aro noi alniolutoly 
indopoiidoni, 



The CoBPrrciBNT op Correlation. 


Wg have to find a numorical moasiiro of tliai dependenco. 
If the curve that we obtain is a straight lino, wo havo only to 
find a means of calculating its inclination. Before procoodiiig’ 
to this, lot us spend a few words on tho caso whon tho curve 
is not a straight line. Suppose that wo liavo sulficiont 
observations to determine by experiment and observation tho 
actual sliapo of this curve from largo groups, we could, 
without applying any further theory whatover, ostablisli tho 
connection between tho .r's and the y’s j tlio curve can bo 
plotted out, and given algebraic expression, if pos.sihlo ; and 
thou we should bo able to say that for a particular value of s 
tho most probable value of y was the ouo obtained on this 
curve. We could have a curve simply from oxporionco, and 
use tho experience with similar plionomena at another tiitio. 
For instance, if wo had that exporioiico of tho longili of tho 
lives of the children of j^arents who lived to various agos, wo 
should ho able from thus empirical curvo, to say if a maids 
father lived to a certain ago then tho chancos of tho life of 
the son are given by a froquency-curvo whoso contro was 
found from the empirical diagram, and whoso shapo might 
very likely be known also. In many casu.s, liowovor, tho 
curve of average.s is approximately a straight lino. Even if 
the approximation is nob very exact, it may bo useful to 
calculate tlie inclination of the straight line that passes 
nearest the averages. Let us suppose that wo liavo the 
equation of this line, y — ax-i-h. Consider any observation 
Vv i if this observation lay exactly on that line, y,. would bo 
If the observation does not Ho on tho lino, its 
distance from it, measured parallel to tho axis of y, is 
y, — + h) . To obtain tho best values for a and /j, which 

arc tho only unknown quantities, wo can proceed* by the 
method of least squares, and make tho sum of tho squares 
of such quantities as a minimum, Tlion tho 

differentials of X(y,. — 6)^= a (say) with regard to both 
a and h must be zero. 

Sw 

Thus -- 2txy + 2bXai= 0, 

hit 

= 2nh + 2aXft; — 2Xy = 0. 

* Soo below p. 73. 



()Ii<K)H(5 Mi« ii.vdH HO 1 , 1)0 ih’h tin (I Ulo //’a aro 

nH‘itniiro»l I’rnm l•lloil• iLvt'ni.j^'os, 1,1 ion aiul t,1io 

otjuiil'ioiiH iiH l> -“O nil (I n— v” 5 » i'()(|uivotl passes 

Mivoiif^li I'lio oi‘if?iii, iiiitl il,N ('((inil'inu is ;(/= y* a *<i!. ljoL( 7 |,o-a 


1)0 l,lio Hliuitliiril ( In vi Ill-ions oT l-ho f*T0Hps of t«’s anti, of i/’s, so 
iJnil- iumI lol, r™ j I, lion ilio above 

•' n<ri<r.i 


o<|iiii(>ioii boooiiios 


V- 


^. 1 '// _ __ f<r.i ^ 

• .)! — ,)• 
tir}) <ri 


I, bill, is, =r /• . I'-.t 

f/a <r\ 

111 ordoi’ 1.0 miiko r symiiiol'rioal, il, lias boon nooessavy Lo 
(liviilo by rri inn I w-j, l-lial, is, l-o nioasiii’o it- and ij by Uioir 
hI,iiih 1 iii *(1 (lovinl iotiH. II- is a voi*y nal-iinil Uiing* to do. Bolero 
\vt> ^•an ^vol, any nnnn'i-ioiil no in pa risen, wo must reduce tlioni 
to s«)ino coniinon inousni'o, and a ooiiiiuon unit whicb wo oan 
very rousonably adopt is l-lio Htandard deviation for each of tlio 
two thiiif^H. If wo lire doaliii;*’ with tlio (piostion I sn^^’g'ostod 
just now -the niiir ring'd riito and the prioo of wboiit — wo 
c-annot coin])uro sliilling's svitli a rate per thousand, but we can 
coin pare a ratio of the niinilior of shilling's to a standard 
nninbor o\' sliilliiig's, with the ratio of the rate ])or thousand to 
a stain lard rate per thou.san<l. Wo aro tlioii comparing* 
absolute instead of done rote (plan ti ties. Wo should got 
siniilar (upiations if wo used the modnlns instead of tho 
standard (hiviation, or tho ])rol)ahlo errors, or tho moan 
doviatioiis. h’or rapid work wo could rophico tho cr, and 0-3 
by the iirobable errors, wliioh aro proportional to tho standard 
deviations in onrves which approxiinato to tho onrvos of error. 
It is to bo notioitd that wo (Jiin express tho (pianiity r in tho 
following form : r is tJm Jivorago of such produots a.s 

r is oiillod the woOidonl of corrolatunu Tt is not 

0*1 O'!) 

dillieult to show by pure algebra that tlio quantity r so 
(lotormiuod must lie between -I* I and — 'It; and that?* equals 
~1' I, only if tho ratio of every ir to its oorrosponding* y is 

* 'I’lm IhhI fow iMungriiphs m'O BiihHluntially tho sumo ns Hiosd given by 
Mr. Yuli) ill tho Journal of the 7toi/al Hlalistical Hooiolt/, 1897, p. 817 soq. 

t* Ki'd AV<'hi«h/v of ]j. lilfl. 



idoiitically the same as the ratio of every other u? to its 
corresponding 1 /, so that the ratio ^ :a; is consiantj and equal to 

If the ratio is constant and = — — , r bocomes — 1. 

CTl fTi 

and an increase of x corresponds to a dimiimiion in y. Thus r 
is always between + 1 and — I, and between these limits 
there is a scale of correlation. For histancej wo can say that 
the correlation between two sets of plionomona is '6 or —■3. 
Of course,, when one is first introduced to a new scalo of any 
sort the numbers in the scale convoy no meaning-; it is a 
matter of experience to attach the rig-lit value to the different 
magnitudes in the scale. Perfect co]-rGlation can bo understood 
from the statement that groups aro perfectly correlated if a 
deviation of a meinbor of one always equals the deviation from 
the average of the corresponding* moinbor of the other 
multiplied by an assigned constant. If the two things^ 
marriage and wheat prices, wore perfectly (negatively) 
correlated, you would be able to establish some such equation 
as this : An incroase of '1 in tlio mai’i’iage rato is always 
found with a diminution of Qd. in the price of wlioat. Of 
coiirso, such a rigid relation is never obtained unless there is 
some physical cause binding the two things together. As tho 
ratio of corresponding pairs tends to constancy, tho correlation 
becomes more and more perfect. That must bo regarded as a 
definition of correlation. 

Now consider the sum of tho products of x and y, and 
let us write X for - , and Y for ^ . 

CTl O-g 

If there were no correlation, if wo seloctod the values of 
Y winch corresponded with a particular small range of 
values of X, we should bo likely to find a negative value to 
neutralize each positive value of Y, and the products arising* 
from that range of X^s would tend to zoro, and tho greater 
the number of terms the less the distance of their averago 
fi'om zero. But directly there is any bias towards g'otting 
the positive value of Y for this particular range of X"s, as 
Ave increase the terms avg may still get negative terms hero 
and there, but on the Avhole Ave shall get positive terms; and 
so on, all the Avay up tho scale of X^s. When there is 
correlation it is clear that the sum of the products tends to be 
greater than Avhere there is none, Thus it seems probable 



from fii’Bl principles that tlio quantity r thus calculated will 
make a good lueasuvo of correlation. 

Tlioro is an important caution to be given in the use of 
this formula. Ifj from two series of phenomena Avliich were 
absolutely unconnected, we took a limited number of examples, 
say a thousand, and worked out the value of r, we should not 
obtain exactly zero, or rather tlie chances are very much 
against obtaining exactly zero, even if there was no correlation; 
and if wo took a very small number of examples the chances 
arc very mncli against obtaining anything near zero. As we 
increase the number of samples, if there is no correlation, 
the coofRoieiit will tend more and more nearly to zero. What 
wo requiro before we can use the coefficient is some criterion 
to enablo us to know whetlier the formula is significant, 
or Avhothor the actual number might have arisen if there had 
boon no correlation whatever. Such a criterion is given below 
on p, 88. 
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The numbore given are In oveiy case tlio nearest thousands. 


A numerical example I have prepared will put the 
calculation in a clearer light, The tahle here given shows 
tlio numbers (to the nearest thousand) of the wives and 
Imshands in the County of York in 1901 at various ages. 


grouped in periods of five years. For example; if yoi* 1^^ ^ 
the husbands’ ages from 40 to 45 and look along the Hno }OU 
will see that tliere aro two wives between 25 and 80j feovon 
between 30 and 35, 25 betwoon 35 and 40; and so on. 11 
was not practicable to deal with G1 0^000 cases'; and I lavt 
therefore dealt with the thousands only, and approxiuiatec 
throughout the calculation. Tho fact that tho nninbeis inii 
diagonally down the table as they do shows at once ^tliei o is 
correlation. I have taken a case where tho correlation is iieai ) 
perfect. If we had a table where the correlation was voi'}^ ^*^**\i 
we should find tho numbers distributed in random fashion a ^ 
over the table. In such a list of figures as this it is not voiji 
practicable to take tlie arithmetic average. It is oasioi am 
as accurate to take the medians. I have approximated to fc lO 
medians for all the gvoupS; both horizontal and vertical, 3 
tho methods already explained. To take a particular ex amp o, 
consider again the husbands who are between 40 and 45 j^'cais 
of age. If you look along the list you will find there ai’O m 
all 78; and that tho median age is 40'7 years. Or if you take 
a vertical column, if you choose those wives who aro bofcwcon 
40 and 45, and look vertically downwards, you will find that 
tliere are in all 77 of them; and that tlie median age of then* 
luiabands was 43'9. 

The diagrams show the medians graphically. In the flr.sfc 
the ages of wives are measured horizontally; those of husbands 
vertically. Above the middle point of each five-year period 
IS placed a dot indicating the median age of husbands whoso 
wives* ages come in that period. ThuS; looking' upwards from 
the position of 37^ years, the middle age of tho group of wivoH 
between 35 and 40, you will find that the dot indicating' tlm 
median age of tho husbands is placed at the 38 '9 years. Yon 
see that the points so obtained lie very nearly in a straight 
line. At the top and at the bottom the line becomes a 
little bit curved, for the influences of the lower and uppou 
limits of ages make themselves felt. If wO tided to get a 
normal distribution for the group of wives who are under 20 
we should be getting hn.sbands at 13 and 14 yeai’S of ago, and 
at the other end of the scales we .should have got husbands 
at ages at which there are no people alive. Tho fact that the 
scale is limited at both ends is the cause of the deflection of 
that curve from tho straight line. We have now to measure 
the iuelination of that line, In the case I liaye taken, where 



Bia&r.vm XI. Diagram XII. 

Showing coiTelation heiiccen ages of hislands and their wives. 

Median ages of liusbands. Median ages of wive? 



Ages of ndves. Ages of liusliands. 

angeiit of tlie inclination of the line through the dots is '97 (nearly) The tangent of the inclination of the hue through the dots ia *92 (nearly; 



tlie correlation is so perfect, thoro is no difficulty in measuring 
tlie line, because if a straight line is drawn througli tliroo or 
four of those points it j^asses very near the others. Hut in 
otlier cases, it is not so obvious which straight lino is to bo 
drawn ; and then wo can proceed by tho method of least 
squares already taken, or you can proceed by the following 
practical method which yields good results : — Mark out two 
lines horizontally and vertically through tho averages of tho 
two groups, and rotate a ruler througli their point of 
intersection until tho same numb or of dots is found on tho one 
side of it as on tho other. It will bo found that that method 
gives a definite position of tho lino which passes very near tho 
points ; it is a purely empirical way ; but as the coefficient 
of correlation need generally not bo calculated witli 
great minuteness, it will in general be siiffioicntly corroet. 
It is often absurd in cases of probability to work out 
the results with very great accuracy. Hdio lino is not 
drawn in the diagram above, because it would have 
obscured the dots j but underneath is given tlio tangent 
of the inclination to the horizontal of tho lino which would 
satisfy the conditions, the tangent of this angle is '07. 
Tho second diagram is constructed in a similar way, for tho 
median ages of wives, whoso husbands aro in a given 
group; tho tangent of the inclination of tho lino through 
tho points is now *02. Tho avorag'o ago of tho hu.sbands i.s 
42-16 years, with standard deviation 12 '6 years ; tho average 
ago of the wives 40- 11 years, with standard deviation 
12'1 years. Tho statement wo have now obtained 
is of this sort; — If we are dealing with a man whoso ago 
is /i, in excess of tho average, and wo wish to know 
the age of his wife ; the value of 'lo in tho O([iiatlon 
/i,~42'lG = '92('iO“40'll) is nearly the most probable value of 
her ago. That comes at once from tho geometry of tho second 
diagram. From the first diagram wo obtain similarly ; — 
Given tho age of a woman as being w, so that tho deviation 
from the average is mi~ 40'11, then the median ago of tho 
husband group is 42*24+ '97(11; -40'11). Wo shall probably 
also need to know the curve of frequonoy for oach of tlioso 
gioups. Unless there is a reason to tho contrary, J think in 
general that we may assume that tlio curve of frequency for a 
selected group is similar to the curve of frequoncy for tho 
whole group from which it was selected. So that wo can 



calculate the fsfcandard of deviation for this pfirticiilar ciirvo of 
frequency when you know the standard of doviation for the 
whole group, ddibit in to say, wo can ascertain the olianco 
that the ago of the husband of a particular woman is 
any assigned number of years above or below the ago liero 
solocted. In the little table given abovo we can actually find 
those small curves of frequency ; for iiistaucGj in tlio ages of 
wives 30 to 35 yeans of ago the curve of frequency for the 
husbands goes as follows : — 9, 49, 29, 7, 2, 1 . 

Tho abovo is the graphic way of working out the question. 
Wo have noAv to show its relation to tho formula for r, the 
coofliciont of corrolatioii. Tho quantity ro-g/cr, in tlio equation 
y = rcr 2 tvla-i is tlio quantity evaluated by the diagram as ‘97. 
that is tho tangent of tho inclination of the lino to tlio axis of ai, 
whoro ages of husbands and wives are measured on the axes 
of s and y respoctivoly, and cr,, p-g are tho standard deviations 
for wives and husbands. If I had reduced all tho moasure- 
monts to tho standard of deviation boforohand, ')*, the coefficiont 
of correlation, would have boon tho tang'oiit of tlio iiieli nation 
of tho lino. Tho question which is the easier to work, decides 
which of tho two mothods you adopt. If you work it as 
I liavo done, witli tho samo scale of years vorbically and 

horizontally, you would havo to say that r = *97 x ™ , and 

CTa 

from tho iowor diagram that r — ■92x— j whonce r is the 

O' 1 

goomotrical moan botwoou *97 and ‘92, betweon tlio taiigoiits 
of tho inclinations of the line calculated on the two diltorout 

fT /‘9~2 

hypotheses, namely, ‘Qd'S; and — == = '974. 

SciJ U 

Now let us proceed to calculato r by tho formula — 

uo-io-g 

It is, of cour.so, a long business, and f shall not givo tlio work 
eomplotoly; I shall only indicate tho way in which it was done. 
'J'ho problem was to fine! that product for 010,000 pairs, which, 
of courso, is a prohibitive piece of work, and cannot bo done 
accurately, bocanso tho ages are not given oxcopt in 5 yearly 
limits. Wo jirocood by approximation. Thirst of all, I noglcot 
all tho niiinbors below 1,000; socondly, 1 assunio that the 
numbors loft aro at the middle of their respective groups. 
Thou I deal with the 60 or 70 numbers in tho table on p. 67 in 
the following way. Select a group, e.p., wives whoso ages are 



25 io 00; tlio inidfllo, 27i, is V2'C) yoars l)ol(nv Iho av(>viif>’(' 
of all wives; express iliis aiul uMior dovialioiis iti Um-iiis oI’ Mio 
standard doviatioiij 12*12; l2‘()-i- 1 2* 1 2= 'I liiit is f-lio 
term to 1)0 a])pliGd tliroiig’lioiit tliis groii]) of Inisbaiuls. ^ (m 
ilirougli a similar process for tlio u’/s. This (5 is a(- the middle 
of the group 20-25 years, namely; 22^, which is years 
bolow the average ago of all husbands, and that in l((U'nis el 
tlio standard deviations is about TO ; werh out the etln'r 
deviations, whicli ai'o in arithmetic progression; in the 
same way. d’hon multiply the unml)orH in the group; 0, <141, dl, 
7; 2; 1 each by its deviation; add; and multiply the sum by the 
deviation 1*04 for the group. 

lOxA^rmn : 
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Disliiiini lioiii 

No, 

I'KldllCt 

lUisbaiKlH 

AvoinHO 

20-2,5 

1 - 

(i 

- i)41 

25-30 


41) 


30-35 

“ •« 

31 

- 24, 'K 

36-<I0 

- *4 ^ ^ 

f- '03 go 

7 

- 2'H 

40-d5 

2 

1 '1 

46-50 



1 ‘4. 

Sum — i)(}'6 


Cy()iTospoiuliii;j piu’t of is — I'Od' of — I lOU'O 


Some of the resulting terms will be nogaldve nnless tin) 
correlation is considerable. i\dd these terms, and divide the 
sum by n (in this case 002), and the coo/Ticient of coi'i'ohitien 
*96 is obtained. In the method I suggest using wo do net 
deal with any large nnmbor.s at all. Mdio number is net far 
from the goomotric moan of *945 found Ijy the graphics niothed 

above. ALso -“=*90 (instead of '07 ns reckoned alsove), 
r X ~’02 (the same as above), r X — = I'O (instead of '071. 

CT] 0*1 ' ' 

Justification of tiiw Foudiula fok v. 

In the method of linding the formula for r on page 04, we 
used the method of least squares without examining its 
suitability, I will now give rcason.s, which have not; so far 



UH I kiioWj bo(Mi proviotmly ofTm'od, in favoiii' oC this method. 
If tlio wo lu'o dotiJin^’ wiUi Ix'long' to Llio normal curvo 

of on-oi'j tlioi’o iM no dilliciilty. If thoir mirvo of froqiionoy 
liiiH any otlior foniq ntill tho avoi-af^’OK of Holootod groupK, 
ropvoHoiiiod by tlio doin in DiiigramH XI and X U , aro govornod 
by normal (iiirvoH of orroi' (.mj page 50), Lot i}„ ya, . . . i},„, 
bo tho av(n-ago.s of groiipa of y\v, omilaining rospootivoly 
/I'l, Ih, . . . I<■‘m ittmiH, wlioHo HI valnos aro .c,, ii'a, . . . a;„,j m iluit 
Mio kr <r,\s wliloli, in tlio groniilng of fii'w adojited, aro in a small 
f^Toup wlioHo o-oid.i’o in av, liavo //-pairs whoso avorago is y,.. 
Lot //— n,r.-(-/j bo a lino wlilolt oontains tlio valnos of 1 / from 
will oil tho obsorvod //„ aro doviatiuns, ' Thou 

{lff, — (tir,y~~h) is a (piantity whoso frcapioncy-onrvo is 
I 

'■ mudulns is invorsoly proportional 

to ^//.v, /iv Imiiig tho mimbor in tho ir,., y,. group. The 
probability of Hindi dnviatlons ocourring togothor is (ns on 
]»ago 40) a maxinniinj when y<k,. {y-^axy — b)'^ is a niinimum. 
I<)(|iiating tho jiurtial dilVorontials of this sum with roforonco 
to ft and I) to j'.oro, and romomboring that ^/r,,y,.=0 = ^/i:,. 

II* tho iloviations aro inoaHiirod from tho general avoragOj wo 
Iiavo, as on ])Hgo 05, anil i)/iv ( f/ ir, .//,.) =0. UoncOj 

(t ~ hi/f, -4- liv/iViiv’'' Xtr//*^ whoro tho summation oxtonda 

ovor all tho iiairH, Thon, as hoforo, r= , 

(Ta n(T\cr2 

Oonsidoration of tho nature of tho formula will^ I think, load 
to the eomdnsion, that tho eoollioiont of eorrolation oalculatod 
by the formula is it good moiisuromont of eorrolation, whatever 
(MirvoH of froijnouoy you urn doiiling with ; and it is surprising 
Iniw very rapidly it small extent of oori’olation makes itself 
Jolt, oven when yon deal with ipiite a few examjdos. If 11 is 
only 20, you will soon (Ind whether (hero is eorrolation or 
not by this rormula, If you soloot gi'oujis whoro tlioro is no 
tiorrohilhni tho orilorion, disoussod liolow, sliows Unit tlio 
corrohition is not sigiiinoiint; hut dirootly tlioru is likuly to ho 
oorrohitioii botwooii tho groiqis, this formula for r shows it. 
'IMio eoollioiont of eorrolittion eau lie used thon in a very largo 
rogion of cuhoh in whioh it is voipiirod to test the oonnoction 
between two sorios of plimioimma. In particular, it oan bo 
used to doiddo whotlior two seriim of plionomona are ontiroly 
uiiconneetod or not, wliieJi snhjoot nocessitatos a proliminary 
treatment of tho natnro of series. 



MEASUREMENT OF SERIES. 


SIXTH LEGTUEE. 


Seeies. 

I PROPOSE to doEbl ill this Iccturo^ first of allj with series 
in general j and then -with the comparison of and correlation 
betwooii two series. By a series I understand a list of 
numerical events recorded at regular interval s, for example, 
recorded once every year. In representing' a series by a 
diagram wo measure time on the horizontal axis, and 
dividing it up into years, we erect an ordinate at the point 
corresponding' to each year, to reproseiit on a suitable scale 
the magnitude at that particular year. The question whother 
wo slioiild represent those magnitudes by dots or lines or 
rectangles is important, but it is decided on the principles 
discussed when we wore dealing ■with the representations of 
groups, and we need spend no more time on the analysis now. 
Perhaps the most natural way of ropresontiug such series is to 
erect a series of rectangles whoso areas are proportional to 
the successive magnitudes j but if wo leave the diagram in 
tliat form it will not Be very clear, it will be very ugly, and 
certainly this is not a neat way of linishing the representation. 
Tlio next stB^D is to draw a continuous lino to replace the 
rectangles; the commonest way of doing this is, to mark the 
middle points of the tops of the rootangles, and join those 
points by straight lines ; but this method is erroneous, for the 
same reason that it was erroneous in tlio representation of a 
group. Wo need to draw a continuous line so that the areas 



(Mnil4nn.i(l ill ro(!ln.nf>'ioH in i,lio livsb i)laco, and hy tlio 
(jurvnil l.ni,|m/,iiims iti ilm hcicoikI, hIuUI 1,u (.(juiil in ovory case; 
inil, (.Inn more corrccl, mirvii \h ])raoUoally coimndont with tho 
erroiiemiM lincHs it niakoa very lilUo diffovonco in 

jH’m'lac.o wliudi ol' t.lm two we draw; ihoy ivivo cortaiiily the 
Maine eiitieii.l imin-i'HMion. II*, liowevor, wo do roplaoo Uio 
rec.taiifvles i»y a contiimoUH line wo avo making an assumption 
wliieli iH^ Hoinetiiues jimtiliod, Imt wnnotiinos not; hy tho fact 
of drawing n, rentinnens line wo givo tho improsaion that tho 
event re|)reHente(l ia eetitiniially taking plaoo. This is correct 
in represeiitiitieim of hirlhs, doatlm, and marriages, and it is 
piLi’tly enrreet in re]) resenting inqiorts and exports hy curves 
lint it is net eerreet ill the rep rose ntatieu of ovonts which 
only oeoiir once each year. 'Plume aro details wliich aro easily 
analysed. 

()liAHHll''l CATION. 


'Pile .serii'H, or the curves which represent them, can ho 
(livide-d into threo main ehisses : periodic curves, symptomatic 
eiirves, and otliersj or instead of “ethers,” wo may say ciiiwcs 
with random fliictuatiims. Peri o< lie curves aro those whore 
siinilar lluetnatious riMinr at eijiial intervals of time, ns tho 
annual (liietiiatioii of teiiiporature recorded month hy month, 
Symptoinatii! eitrves are those which ha-ve a definite tondoncy np 
or down, a “ Hyin|)toin,” though short periods may ohscuro it, 
as the (hnitli rate since ltS7(). A eiirvo, which is noithor 
jiei’iodie net syniptomatie, may often he regarded as having 
riuidom (luctuidions about a stationary average, ns a curve 
ro])res(uil.ing; the annual averages of any niotoorologioal 
])li('iioninim, such as a-verngo tomporatiire year liy year. In 
tlie Diiigraiii Xlll’^nll four curves aro symptomatic; the first 
three ii-re dowiiwa-rds, luul tlie hist u]) wards for tho first 30 
years nint then lu'iirly level. 'Phe series I’oprosonted in 
Diagram XIV has apparently random lluetuiition.s. 'Phoso 
curves aro not periodic iti any strict sense. 


PlOlMOniO CUHVKS. 

'Pho first thing to discuss is, how to d Ison tangle the period 
from tlie symptom when a periodic curve is also symptomatic, 
or how to moiisure tho period if tho curve is not symptomatic. 
'I’ll ere is not Sjiaeo to ilisciiss tho matter comi)lotoly, and J want 
vatlior to imlieate tho methods, mill leave their considoratioii 

* Si-o ]). HJ , 



to t]iG reader. A curve often suggests two things : first, that there 
i.s a regular period, and, secondly, that there is a mo vein out 
apart from the period. Assume tliat wo are dealing' with 
monthly observations and an annual period, fi'o obtain the 
inovemeut apart from the period, take the avorag’OH oi the 
12 mouths of each year and mark them on tlio diagram ; 
these points would sliow the average rate for tlio year, when 
the readings of the vertical .scale have been adjusted. 13ut 
tliere is something arbitrary iu bogiuiiiug the year at tins l.st 
of January. The deaths, births, and marriag 0 .s, and any other 
figures wo deal with are probably indopoudont of that 
particular beginning of the year, and if we make coinpari.sons it 
may be better to take other periods to start with ; for inatanco, 
tho fiscal year begins on April the 5th. We want a oontinuoua 
representation, which we can obtain as follows : — First take 
the average from January 1st to JDecembor 31 si. ^'hon tho 
average from the Ist of February to January Slst, and so on 
until we got 12 dots every year. It is clear that tho ourvo 
through these points cannot have any sudden flnotuatioiis; tho 
curve so obtained shows the symptom when the period is 
eliminated. The theory underlying this method is quite 
simple. If we take any particular 12 months, wo shall 
include the whole influence of tho period, tho excess in ono 
part and the defect in another, and if wo avorago 
them wo shall pi’obably get the number which would 
have occurred if there had boon no poriod, and if 
the flow had been regular. It is approximate only, because 
tlie various small fluctuations will affect tho avorago, and it 
can be improved by smoothing the curve. If tho series is no(/ 
symptomatic the resulting smooth curyo should bo a horizontal 
straight line. 

Now, in order to measure the poriod ns apart from the 
symptom, the only method is to write down tho rates for tho 
50 Januaries which we may bo dealing with, and take tho 
arithmetical average, the mode, or the median of those j to 
repeat the process with the Februaries, and so on j ami then 
to represent the successive averages for the 12 months by a 
separate curve, which is best drawn with a base line througli 
the general average of all the data. Wo thus got such a 
curve as that given by the graph of ^ = sm from 0° to 300°. 
The justification of the method is simple. In tho 50 Januaries 
we include one January from each part in the symptomatic 



iMirvo. All Uh) o\(uihhoh diiu io Uio HynniUnnalic tondoucy will 
Ik) cDiiiiUiv-lmJiiuciitl by iJio (bd'octs, or willioiid to bo comitor- 
biihiiuuMl by Mio dob'olH, duo iiIho i,o Kyiupioiiiatiu iondoiioy. 
'riioy will only Uuid |,o Iw (UMinlor-bidiuuTd j for if wo take 
1,1)0 bO diLnii!i.ri((H wo intdudo iimoiif>' tlioin sonio oxiruordiiiary 
iiioniJiH, luid Hoiiio luotdili.s w1ioh(\ dovijiiioii I'roiii ilio annual 
nvonif^'o in quito Hmiill. Tito aooiiraoy wiLli which wo may 
o\l){'('l. b) tiio l-nio dimuirry roudiiij>‘ \n proportional to iho 
H<(iinro root of tint nmiiltor of timoM takoUj from tho theory of 
iivi'raf^'t'H (listtusHod altovtt. In carryin^t' out tho inothod, wo 
implicitly jihhuhio that tint civuho.s winch docido tlio .symptom 
tnid tlio ('iniH(tH wliich docido tlio period are indepondont, 
whiht /^’('intridly tlnty are not iudo])ondont. If there is an 
iiiciaiasiii^' di'atli rabt or an incireasin^' want of oniploymont 
lit tint Hiinii) timo that tint winter is ositcsually sovore tho one 
will ac-oontiiitio tho otlutr, It is vi'ry ea.sy to see how tlio 
riisult may ho nlloctod. Mupposo some industrial di.histor 
Idirows a />M'oat i>roportion out of work in August in one year, 
HO as to iiKToaso the ponton iago of an employ ed, wo will say 
to 50 , til on wliou taking tho average for ten yoar.s, that 
ligiiro aim 10 gives a rate of b ])or cent, in August whereas 
the excess imd nothing to do with tho fact that August was 
Mill iiioiitli conciinioil. If you take a Hidlicionb number of 
yoai's, however, those things will tend to eipializo one 
iiiiother, and if wo use the median instead of tlie arithmetic 
iivera-ge extraoi’diiui ry oucurreiutes have little effeet. J‘’er 
tills reason it is best, to estimate the jieried from the medians, 
111 the enil we sludl not get a smeoth enrve for our averages, 
and may have to smooth that by a ti'igononiotrical function, 
or liy seiiui otiior motliod. 

SYJiii'TeMA'Wo Hiflini'is. 

Wo will now dismiss tlio symptomatie curves j the top 
enrve in Diagram XI 1,1 (male death-rate) will do as well 
IIS any as an illustration, for the method of dealing with this 
eurve iqiplioH to a very great number (d‘ suoh curves. All 
HtatistioH represimting seololegical phenomena tliat I have 
bad ex]iei'i(moe of aro Hymptomatie. Perhaps in very rare 
eases you will liud no symptoni, Imt in general tbero is a 
Hynqitom ; however remotely cimiieoLod the ligures aro with 
Idle genera] progress of olvili/sition, yon will find there is 
Home Hyiiqitem up or down, or alternately up and down. In 



goneval we may aasume a symptom in all figures relating to 
human society. In dealing with such curves, wo somotimos 
want to exa-niine them in detail for a short period; but very 
often we are more concerned with the symptom, especially in 
forecasting events. In curve A in Diagram Xlli there are 
considerable and rapid fluctuations, but there is also distinct 
optical evidence of a fall in the 3’ate beginning between 1805 
and 1870. The causes which prodnecd the actual size of the 
ordinate are, of course, very man}'’, and it is impossible to 
draw tlie line between tliose which tend to make a gradual 
permtineiii change, and tlxoso which tend to make a sudden 
tomporary change. It is a question of degree and not of 
character, and for that reason alone it is impossible to give 
any theoretic solution for distinguishing tho symptom from 
tho small fluctuations, just as it is impossible to give any 
general .solution to the interpolation problem. "We have tlion 
to find an empirical solution, one that satisfies our immediate 
needs. It might appear best to draw a straight line, which 
on the whole shall cliflor from tlio observations as little as 
possible, and whicli could be determined by tho method of 
least squares ; this would assume a symptomatic tendency to 
equal increments or deoremonts in successive years. Or wo 
miglit assume a parabolic curve or logarithmic curve. A 
recent American writer lias a3.sumed that a certain series can 
be represented by = the compound interest equation, 
Dut I think in general there is no reason to assume any 
definite algebraic law. Ifiie solution I should sug’g'ost — it is 
a commonplace one — is similar to that I have just suggested for 
the removal of the period. It is most easily understood by 
an example. 

The figures in tho following table are from the Regislrar- 
GfoneraTs Returns, or are calculated from the Statistical 
Abstract. 
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Take the last group of figures, tho death rate o£ females. 
I take the average of the first five death rates, 20“ I, in 1845 
to 24' 4 in 1849, namely, 22*6, and place in the penultimal.o 
column at the middle of tho period, namely, tho yoar 1847. 
I begin again at the second yoar, 1846, and take tho average 
for 1846-1850, namely, 22’6 again, and place that at 1848, tho 
middle year of that period ; and so on for 46 succossivo 
periods. Then on the Diagram XIII, I have roprosentod that 
line of moving averages by tho dotted line running througli 
the conti nil oils line. I think it is clear that that lino offers one 
solution of the problem. In taking the average of any five 
years we are equally likely to include tho ups and downs of 
their fluctations. If there was a regular period, if tho 
fluctuations were five-yearly wo should remove them entirely 
in five years, it would bo the obvious time to take. If wo 
were dealing with figures referring to industry and the period 
was ten years, ten years would be the most appropriate length 
of time to average, including as it would one contribution from 
each i^art of the fluctuation. If there is no regular period 
thei’D is no rule to be given as to what number of years you 
shall take j it is a matter of convenience. If the five-yearly 
average gives you a curve with .sharp angles and apparently 
random fluctuations, increase the number of years. It is most 
convenient to work with an odd number of years, for tho 
middle of the period then coincides with the middle of one of 
the ^mai’s ; but, on the other hand, a period of ten years gives 
arithmetical facilities. This method may, I think, bo loft 
for consideration j I believe it will be seen that it offers a 
solution of the problem. To complete it, I recommend 
replacing the dotted line hy a regular curve drawn very near 
to it, smoothing out any little fluctuations whioli arc loft. 
A curve thus drawn would fall from 1847 to 1858, and 
rise for about seven years and then fall, fairly rapidly, to 
about 1882, and more slowly afterwards. In the nature of 
things we cannot fix exact years for the end of tho rise or fall. 
It is absolutely necessary to have some such method of 
measuring the symptom before you can base any argumont as 
to tho change in the quantity measured. That is very 
important. Dor example, the curve D, whicli represents 
imports, is a sharply fluctuating curve with a partial period. 
If, to take a particular date, we had in 1879 looked at the 
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previous two years only we should have though i Ihoro was a 
rapid fall in the average imports ; Lut if wo looked at ilio 
history of the phenomenon we should have scon that it 
appeared to he only part of a minor lluotuaiion^ and in 1882 
we should have seen that average imports had ho on stationary 
on the whole for eight years. 

It is not possible to say at the moment whether a fall is of 
a permanent nature or simply ono of those littlo lluotuations. 
which characterize the phenoinouou throughout the half 
century. Per instance^ hy 1908 wo can porliaps judge oL’ tho 
tendency in 1900, hut cannot judge of the currout year 
because we have not enough information. 

The deviations obtained by subtracting the iiistantauoous 
average from tho figures for each .year arc givon in tlio last 
column. The deviations for the first tliree grou]3.s of ligiirc.s 
in the table are calculated on a simihir method. 'Jdioso 
deviations should havo somo affinity to tho curvo of error. 
Great deviations should be rare compared with small deviations, 
and the occurrence of small and great deviations should havo 
some such relation as the occurrence of great and small 
deviations in the curve of error; but tho agrooinont is not 
likely to be close, for the deviations calculatod hero aro not 
independent one of the other ; they are bound logothcr by tlio 
fact that the same number is used in forming* fivo suooossivo 
averages, while the curve of error assumes that the things aro 
absolutely independent. 

Correlation between Series. 

That is a very rapid discussion of a rather wido subjoeb, 
but I must lead on to the corrolation between two sots of 
figures. If we Avere dealing Avith a curve Avilh no symptom 
and no period, foi* instance, tAVo sets of figures rolating to tho 
Aveather, aij, representing* the average temp ora turo, 

yi> yu ■ • • Vn representing the average Avind velocity, tho 
correlation betAveeii these tAvo should be calculatod as already 
described. If avo Avere dealing* Avith a periodic curve avo 
should replace tlie periodic curvo by its lino of avorago 
before comparing it Avifch another curve. If there is an 
irregular period, then I think Ave should proceed as if avo had 
a s^iptomatio curve Avitli no period. Of courso, any tAVO 
periodic curves Avitli tlie same period are correlated. Any tAVO 
secpiences of events Avhich are influenced by the annual 



in I'ini \v('iiili('r will a slronj^' dt'fi'roo of corrolation 
(piiii' iiiilt'pninlc'iiily ol’ lUiyUiiii^^ olao. Thai, is a (|iianUty 
whit'll in jp'in'i'iil will nol. Im worth laonHiiriiij^', but when wo 
j'oitio (o voiy ii'i'i'^pihii* pt'i'iod.s siioh ns tlioHo which wo find in 
ti'iido ntiitisticH, i(. is wiirtli imsisni'iu^' tho (loiTolfiiion ovon 
|ihi'()ii<>'h till' pi'i'iods; Ims'imiho it is not so (divioits, for iustaiicOj 
hut all tho (liu'tiiutioiiH of oxportH aro corrolaXod witli all tho 
lliiotiiiitions of iiiipoi'Is, and (hat tlui two togethor are 
i'oiTi'hiii'd with I'ho aimmut of i'iii|)loyin<'nt. 

A dilliciilty nrisi's in di si ling’ witli many onvm from tho 
flirt tlial. tho siirrossivi' iliwiiitions yi'ar by year iiro not 
iill.og'nlJior indi'poiidoiit. Many rnrvi'H which deal with 
soriolojpnil plirnoim'iia, havo Ihic.tiiatioiiH each of which 
('vtoinls over sovi'nil yi'ars, ho (hat a I'iso in ono year is moi’o 
ofton fol lowed liy a I'iso in tho aovt than by a fall. Othor 
oni’\ OH liiivi' till' opposilo ('hariii'toi'j that an oxcossin ono year is 
followi'd by di'l'orts in thi' othi'r; foi' iiistancOj if tlioro is a 
groat ( loath rato in ono year wo may oxpc'ot a oiiinparativoly 
Hiimll oiH' in I ho following; and this iibHi'iico of indopondonco 
shonhl 1 h' kopt in mind wla'ii wi' havo to baso urgnmonts on 
Iho roMiilting' corrolation. Hut aiiiirt from this, wo oonld treat 
tho doviiitioim fi'om tlu' moving lino of nvoragos as deviations 
whoso roi'i'olatioti wo ran fairly ralciihito. 

Tln't'o is a vi'ry pp’i'iit dill lenity in working out the correla- 
tion lirlavoi'ii Hyniplomal.io riii'W's. If wo do not take (he 
drviaihni from tho lino of itvi'ragrs, but take tlui deviations 
from Iho iivoi'ap;o for tho wlmlo HO years, any l,wo symiitomatie 
enrvi'H will show corrolation. If wo tako lavo things which 
art' nbsnhitoly disromioel.i'd, oxei'pli that tlii'y aro both 
phriioinoiia ai'isitig' in tho progress of sooie(,y, ami work out 
Iho roi'lliriont by tho striiiglitl'oi'ward riih', wo shall lind thore 

in If two onrvi's have sliort (limtiiations 

which an' rorrohil.od, but opposilo symptoniSj then owing l,o 
tho ay III pi om apart from tho Ibictnations there would be 
neg’iitivi' rorndalion, while owing to (ho lluetniitions apart 
from tho nymp(<om thoro would lie ])OHitivo corrohitiun j and 
when both lu'o lakon into account the itorrolation may bo 
poHtllvo, y.i'i’o, oi* lu'g'ativi'. It is thoroforo necessary to tieat 
tho sy iiipl oni 'irpiiratoly li'oin tho slioi’t IliK'tiiations. On tho 
wholi' tlioro is not imirli la'imlit in measuring ^the correlation 
coolliriont for tho syinpioins; wo should rather sinpily state that 
tho Hyiiiptoiii is say 15'* iipwiird in one case and 10*^ downward 



ill tlie otliei’. Tlie useful nieasuvemont ol tlio correlation 
between two sucli curves is not tliat o£ tho syinptoms_, but of 
tlie deviations. 

Another question which arises very often in a jiractical 
way is, whether we should compare tho deviation for tho 
whole figures, say imports, with the deviation foi* tho other, 
say the marriage rate, in the same year, or in tho next year 
Can we correlate the imports of 1847 witli tho marriag’O ralo 
of 1847, or should it he taken in comparison with that of 
1848 ? That question will often occur, especially botwoen 
marriage and birth rates. Mr. Hooker has suggested,* a 
suggestion which has been made indopondontly in America, 
that wo should work out the coefficients of corrohition on 
the hypotliesis of synchronism, and on al tomato hyjiothoaos 
that one event follows half or one year after the other, and 
see which correlation is the greatest. In this way wo should 
get a series of correlation coefficients according to tho 
dates we take. 

Before we proceed to measure correlation by mathematical 
formula) we should observe it purely graphicallyj and tho 
gTaphic representation of series will often suggest tho 
existence of correlation, which can then be moasuvocl by tho 
matliematical formula. The curves A and B in Diagram XIJI 
are obviously closely correlated. In tho curves B and C wo 
cannot decide from the figure whether there is correlation or 
not ; at any rate the evidence of correlation is not so groat. 
In tlio curves B and D, I do not think wo could decide from 
the figure as di'awn, tliough we might perhaps from a figui'o 
drawn in a different way, whether there was correlation or 
not. 

Let ns proceed to discuss how to put two curves down so 
as to g'et optical evidence as to whether they are correlated 
or not. Instead of measuring figures as in Diagram XLII 
measure as in Diagram XIY. Plot out tho floviations 
calculated on p. 79 above and below a base lino ropro- 
senting' zero; but before doing so it is necessary to choo.so 
the relative scales of the two quantities so as to havo a 
definite relation the one to the other. There is no obvious 
way of comparing pounds .sterling with one per thousand in 
the marriage rate. The way which naturally suggests itself 
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pp. 490'1. I tlunlv tlmt (Mr. Hookei' was also Llio flvsfc to publish a calculation 
or correlation based on deviations from n moving average, 




jind it is vGiy usofnl for mailing fcliu optical eviJoncu ol' 
correlation vivid, is to roprosont the sinnclarcl doviaiion for 
cacli group by unity on tlio vortical scale. Tlio Htaiulard 
deviation lor tlio doatli rate of males is ‘830 ; of foinaloH it 
is '803 j so VO roprosont tlio deviation ’830 for nmles and 
•803 lor fomalGS by tlio saino vortical lino. T£ vo voro 
doing tlio same thing for imports and male death rates, we 
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slioulcl represent by the same vertical scale '386 of a jioiind 
Rterling’ and '83 clcatli rate, 'fhia method has been ap]ilicd in 
Diagram XTV, for the comparison of linos A and B in 
Diagram XIII. I have pnt in the death rate for males, 
ropfcsented by the ssig’zag’ line, but I found I could not entor 
tlie deatli rate for females in the samo way and make the 
lilies distinct, and therefore have drawn short horizontal 
lines to show the death rate for females year by year ; the 
dots and Hnos representing* the two series are in nearly every 
year close together. Tlio optical evidence of correlation is 
very great indeed. 

The illustration taken is of two series where the cor- 
I'elation is nearly perfect; in less jicrfect cases we can g(it 
cvidonco by noticing whether fclio maxima and minima occur 
at the same dates in tlie two series. Bor example, if wo took 
the value of exports and percentage of nneniployod wo 
slionld find perhaps that the maximum of the one came at tlu^ 
SR.inc time as the niinimnm of the other tlii’ougliout, and that 
ivould give strong optical evidence of negative correlation, 
A. method of testing wliether thei*e was correlation or not, 
wliich would naturally suggest itself to anyone who ha.s a 
.small knowledge of probability, would be to see how often a 
]Jositive deviation of the one agreed with a positive deviation 
of tho other; how often like signs concurred and how ofti'ii 
unlike signs concurred. If we wrote down 50 + and — signs 
at random and another 60 along-side, the chances of getting 
various numbers of agreements are oa.sily calculated, and are 
ill fact the successive terms in the expansion of but 

that is not a good method, for it docs not take into account 
one of the most important considerations, wliothor a g'roat 
II actuation of the on© corresponds with a great fluctuation of 
tlie other or not. Wo should got equal evidence of correlation 
by this method when we liad a resemblance of this sort in 
two curves where great fluctuations corresponded with great 
and small Avith small throughout, and when the correspondonco 
was in sign only. fl'hoso things aro obviously not of tho 
same imporianco, and so the method of merely counting' signs 
will not take us very fur. 

We Avill, then, proceed by the method of calculating 
correlation described above. Referring now to tho table on 
p. 79, it should bo remarked that the method of evaluating 
tho value of imports changes at the year 1852; I had to 



approximate before that year from the values of the expoi'ts, 
because the fig urea given by the Board of Trade before and 
after that date arc calcnlatod on different methods^ and are 
nob comparable. Otherwise the figures uf total value of 
imports for home consumption to the United Kingdom are 
comparable. To ray mind the imports are more significant 
than the exporfs; and also it seems to me ahsnrd to add 
imports and exports j I do not thinlc you can add them 
together any more than you can add bread to butter. 1 have 
taken the imports only, and, without criticising the lig'ures in 
detail, I have divided by the gross population as given in the 
(Statistical Abstract, Thus we get tlio amount per head 
given in the first column in the table. I have only intended 
to work to tlie second jdace of decimahs. The deatli and 
marriage rates are taken from the Registrar-Generahs Report 
for 1896, which gives the figures for the previous 50 years. 
M'ho standard deviations given at the bottom of the table are 
obtained by taking the square root of the sum of the squares 
of the 46 deviations given, divided by 40, as in the ordinary 
formula fur standard deviation. The standard deviation is 
os.scntially an ahsolute quantity without sign. 

I have calculated the coefficients of correlation between 
groups I and 2 (imports and marriage rate), between groups 
I and 3 (imports and death rates), between 2 and 3 (marriage 
and death rates), and between 3 and 4 (death rates for males 
and females). T have intended to choose cases where, a, priori 
wo might expect small correlation, no correlation, and great 
correlation. A prioi'i we should expect correlation in the 
positive sense between imports and the marriage rate; not 
that increased imports cause an increase of the marriage rate, 
Imt the causes which produce prosperity are likely to have 
effect in increasing both imports and the marriage rate, the 
complexiis of causes which decide the two thing.s have 
something in common. The coefficient is 'Go. Tlie marriage 
i‘ate and death rate have presumably very little in common, 
Ono certainly could not say to start with whether an 
increasing death rate would synchronizie with an increasing 
or with a diminishing marriage rate. The correlation between 
the Iavo is —*19. The correlation botwceii the imports and 
the death rate is —'22. The correlation between the death 
rate for males and that for females is '99 ; it is practically 1, 



bill tlio immbGr 1 can only bo obtained if there ia an 
absolute proportion all through the soalo, which there is not in 
this ease. 


GrITISRION 01)’ SlGNJIi'ICANOR OF TIKC CoRRELA'L'lON 
CoiimCIENT. 


Now we are faeo to face with tlio quest ion, What do 
these numerical values ineauj and which of them are 
•sig’nificant V It is clear that soiiio such question urise.s^ 
])ecauso if we write down two .series absolutely at random 
and work out their formuhu the chances are very much 
against your obtaining zuru, and there are heavy odds against 
obtaining a small number. Now tho chaiico of obtaining a 
coefficient near zero increases with tiio number of terms, Lf 
wo liavo two series, iq, u> 2 . . . and 'I'l, fg . . .v^, measured 
from their averages, and wo select a group of 'u’s wlncli ai*o 
near to ono anotlier, tlio ids which will bo tlioir factor.s in 
forming tlio sum of tho products are equally likely to b(.> 
positive or negative ; if wo had an infiniio number of tliosi' 
deviations their sum will bo nothing; and tho .sum wouUl 
tend to zero if wo iiicrcasod the number of terms, tho actual 
deviation from zero being in inverse proportion to tho square 
root of n, tho number of terms. Honco the iiumbor of terin.s 
taken has much to do with tlio significance of the rosidting- 
cooflicient of correlation, and wc .should oxpoct that tho 


quantity 


—-rr- would outer into the moasuromont of tho 
vn 


signilicauce of tho coefKciont of correlation. It is a little 
difhcnlt to state and explain the measurement of tho critorion 
of the signillennoo ; but it is absolutoly necessary to make tho 
attempt. Of tho coofficieuts just given, the first and fourth 
avo found to bo signilicant, and tho second and tliirtl not, 
when tested by the ihoovetical criterion. 

Suppose wo take two correlated groups, and that tliore is, 
as a matter of fact, a definite value for the cooflicient of 
correlation ; and then suppose wo take 50 samples from oaoh, 
that is to say, 50 pairs of events, we .shall not iiaiurally obtain 
exactly the coefficient of correlation Unit belongs to 
the wliolc groups, fl'lie chances are against obtaining 
exactly that result. Now, tho deviations from tho actmil 
cooflicient of correlation which arc obt ained by taking samples 



fi'Ud finding’ tho correlation have a curve of frequency 
1 

y~ — .--e ''-j where ?/ is the probability that tho cocfiicient 
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obiained difforw by .0 from tho true coefiicient, and c, tho 

modulus = (l—r*^) wlicro r is the result obtained from 

tho sample groupj which consists of a pairs. The probable 

1 — 

error in tln.s curve of Oi'ror is ‘07 of —r- For example, in 

V7l 

tlie coeflRcient between imports and the marriage rate, 'jt— 40, 
tho calculated coefFicient of correlation is *05, and the probable 

error for its curvo of frequency is '67 x --- =: '050, 

V 46 


That is to say, from the calculation itself it is as likely as 
not that the actual coefficient is between '65 4- '050 and 
•65 — ’056. The cliaiice of the true coefficient being’ as much 
as tho modulus, namely '115, distant from tlie calculated '05 
is shown by the table of the error fnnotion to bo only 16 in 
100 ; the chance of it being so far from ’05 as to be actually 
zsero are infinitesimal, for in the cxirve of error the cases 
where the deviation is as much as six times the luoduliis are 
practically non-existent. So that we have overwliolining 
evidence, if our general principle of calculation is correct, of 
correlation between tho first and tho second columns, and the 
moat probable value of that correlation is two-thirds. Tn 
other words, the standard deviation of imports being £1187, 
and tho standard deviation of the marriage rato ’37, 
til© most prohahlo deviation of the marriage rate is 
-3-|- of *37 = ‘24, when wo find a deviation in imports of 
-|-£'387, and so on in proportion. This .statement should be 
Goimocied with tho graphic ineasnremcni of correlation 
discusaod on p. 70. In tho second case of correlation, that 
between imports and ilio male death rate, whore tho 
coofficiont is —’22 tho ]n'obahlo error by the method just 
doHoribed is '09. That is to say, our calculntlou means that it 
is as likoly as not, from our ovi donee, that tho correlation 
botwoon tlie.so two series is hetwoon — *13 and —•31; the 
chance that the real correlation is Kero or positive is quite 
porcoptiblo. Tho chaiico from the table of tho error 


^ See Porivaon, in Eoi/al Soo, Trans,, A. 17G, p. 265; nml poi'i'ection, in 
800 , J?ropeedn!ffSf Oct. 18ll», 1807; also Ynic, in Sfaitshcal Journal 
1807, p. 8-1,7. 



function that a negative deviation as groat as '22 should 
occur when the probable error is ‘09 is about one in ten. If 
wo took ten groups whicli had Kcro, or slight positive 
correlation, in one of thoso groups yon might expect to get 
such a rosult as —'22. Similarly, the ohanco that uncorrolatod 
g'l’oups of 4G pairs should give the coofRcicnt found 
between marriage and male death rate, namely, —'19, is one 
in six ; that is to say, once in six gTOiips which were not 
connected yon would obtain that aj^parent correlation. The 
chances tliat you obtain tho coefficient of correlation '99 from 
a random group is practically zero. That is to say, there is 
correlation lietwoen male and female death rates, and it is of 
such a nature that you could, given tho deviation of death 
rate of males in tho yoar, write down with very fair certainty 
the average death rato of females. For example, given that tho 
death rate of males was +'5 in excess of tho moving average, 
tliat then the most probable death rato of females would be 
"80 

.“qo ^ oxGOss of the average, and it is unlikely 

* Oo 


that any rato differing at all far from this will occur. 

Wo have thus found a way of measuring correlation, and 
of testing tho significauco of our moasurement, botwcon two 
groups and hctwcon two series. 'Tho method must bo used 
with discretion. I’ll ore is no time to discuss under what 
oircinnstancGs it is ajiplicable, nor the furtlier developments 
of the theory. 


OONOLUSION. 

In those lectures I have tried to indicate tho comraoii-sonso 
treatment of curve drawing and averages on the one hand, and 
the moi'o delicate and exact method of representing groups 
and serie.s by quantities based upon algebraic work on tlio 
other. Directly wo attempt to use tho latter methods, tho 
algebraic methods, wo find that wo are bound to make 
approximations tliat involve tho use of the ilicory of 
probability and tho theory of error, and 1 have therefore 
been compollGd to deal with these theories. When I have boon 
treating them ] have not attempted to promulgate any 
original opinions, I have only tried to illustrate principles, 
which aro already laid down, by now examples. Dui since 
the modorn shape of tho theories of probability and error is 



now, and involves some matters wliicli are still controversial — 
so far as matliematical reasoning can be controversial — I 
liave found it necessary to spend some little time in examining 
the foundations of tlie tlieorics in some detail. I have only 
been able to deal witli tlie beginnings of some of the difficult 
questions which arise, and I am sorry that for want of time I 
havo been comp ell od to leave out many illustrations of the 
practical utility of the methods; I have had to spend time on 
the theory rather than on the practice. My object will have 
been completely attained if I have succeeded in indicating 
the scope and the interest of the application of the theory of 
error, a subject which urgently needs the co-operation of 
serious students, alike to calculate experimental data, which 
are very much wanting, and to criticize, establish, and enlarge 
the body of theory. 
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